MSG Score: Automated Video Verification for Reliable Multi-Scene Generation
arXiv:2411.19121v2 Announce Type: replace-cross
Abstract: While text-to-video diffusion models have advanced significantly, creating coherent long-form content remains unreliable due to stochastic sampling artifacts. This necessitates generating multi…