Enhancing Physical Plausibility in Video Generation by Reasoning the Implausibility
arXiv:2509.24702v2 Announce Type: replace
Abstract: Diffusion models can generate realistic videos, but existing methods rely on implicitly learning physical reasoning from large-scale text-video datasets, which is costly, difficult to scale, and stil…