Scene Graph and Dependency Grammar Enhanced Remote Sensing Change Caption Network (SGD-RSCCN)

Qiaoli Sun, Yan Wang, Xiaoyu Song


Abstract
With the continuous advancement of remote sensing technology, it is easier to obtain high-resolution, multi-temporal and multi-spectral images. The images carry rich information of ground objects. However, how to effectively extract useful information from the complex image data and convert it into understandable semantic descriptions remains a challenge. To deal with the challenges, we propose a Scene Graph and Dependency Grammar Enhanced Remote Sensing Change Caption Network (SGD-RSCCN) to improve the accuracy and naturalness of extracting and describing change information from remote sensing images. By combining advanced visual analysis technology and natural language processing technology, the network not only optimizes the problem of insufficient understanding of complex scenes, but also enhances the ability to capture dynamic changes, thereby generating more accurate and smooth natural language description. In addition, we also proposes the decoder based on prior knowledge, which further improves the readability and comprehensibility of the description. Extensive experiments on LEVIR-CC and Dubai-CC datasets verify the advantages of the proposed method in generating accurate and true descriptions.
Anthology ID:
2025.coling-main.144
Volume:
Proceedings of the 31st International Conference on Computational Linguistics
Month:
January
Year:
2025
Address:
Abu Dhabi, UAE
Editors:
Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert
Venue:
COLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2121–2130
Language:
URL:
https://aclanthology.org/2025.coling-main.144/
DOI:
Bibkey:
Cite (ACL):
Qiaoli Sun, Yan Wang, and Xiaoyu Song. 2025. Scene Graph and Dependency Grammar Enhanced Remote Sensing Change Caption Network (SGD-RSCCN). In Proceedings of the 31st International Conference on Computational Linguistics, pages 2121–2130, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):
Scene Graph and Dependency Grammar Enhanced Remote Sensing Change Caption Network (SGD-RSCCN) (Sun et al., COLING 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.coling-main.144.pdf