Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback
arXiv:2412.02617v2 Announce Type: replace-cross
Abstract: Large text-to-video models hold immense potential for a wide range of downstream applications. However, they struggle to accurately depict dynamic object interactions, often resulting in unreal…