Training-free Motion Factorization for Compositional Video Generation
arXiv:2603.09104v2 Announce Type: replace
Abstract: Compositional video generation aims to synthesize multiple instances with diverse appearance and motion. However, current approaches mainly focus on binding semantics, neglecting to understand divers…