cs.AI, cs.CV, eess.IV

Geospatial-Temporal Sensemaking of Remote Sensing Activity Detections with Multimodal Large Language Model

arXiv:2605.10739v1 Announce Type: cross
Abstract: We introduce SMART-HC-VQA, a Sentinel-2-based visual question answering dataset derived from the IARPA SMART Heavy Construction dataset, designed for spatiotemporal analysis of human activity. The data…