cs.CV

Frequency-guided Multi-level Reasoning for Scene Graph Generation in Video

arXiv:2604.17298v1 Announce Type: new
Abstract: Video Scene Graph Generation aims to obtain structured semantic representations of objects and their relationships in videos for high-level understanding. However, existing methods still have limitations…