This paper proposes a novel task for UAV scene understanding – UAV Scene Change Captioning (UAV-SCC) – which aims to generate natural language descriptions of semantic changes in dynamic aerial imagery captured from a movable viewpoint. Unlike traditional change captioning that mainly describes differences between image pairs captured from a fixed camera viewpoint over time, UAV scene change captioning focuses on image-pair differences resulting from both temporal and spatial scene variations dynamically captured by a moving camera.
Pour en savoir plus : Hierarchical Dual-Change Collaborative Learning for UAV Scene Change Captioning