We’ll Fix it in Post: Improving Text-to-Video Generation with Neuro-Symbolic Feedback
arXiv:2504.17180v4 Announce Type: replace
Abstract: Current text-to-video (T2V) generation models are increasingly popular due to their ability to produce coherent videos from textual prompts. However, these models often struggle to generate semantica…