Swadhin Das, Vivek Yadav

JSSFF: A Joint Structural-Semantic Fusion Framework for Remote Sensing Image Captioning

Swadhin Das, Vivek Yadav / April 28, 2026

arXiv:2604.24031v1 Announce Type: new
Abstract: The encoder-decoder framework has become widely popular nowadays. In this model, the encoder extracts informative visual features from an input image, and the decoder employs a sequence-to-sequence formu…

Author name: Swadhin Das, Vivek Yadav

JSSFF: A Joint Structural-Semantic Fusion Framework for Remote Sensing Image Captioning