EntityBench: Towards Entity-Consistent Long-Range Multi-Shot Video Generation
arXiv:2605.15199v1 Announce Type: cross
Abstract: Multi-shot video generation extends single-shot generation to coherent visual narratives, yet maintaining consistent characters, objects, and locations across shots remains a challenge over long sequen…