cs.CV

MOSA: Motion-Guided Semantic Alignment for Dynamic Scene Graph Generation

arXiv:2604.19631v1 Announce Type: new
Abstract: Dynamic Scene Graph Generation (DSGG) aims to structurally model objects and their dynamic interactions in video sequences for high-level semantic understanding. However, existing methods struggle with f…