FoleyDirector: Fine-Grained Temporal Steering for Video-to-Audio Generation via Structured Scripts
arXiv:2603.19857v2 Announce Type: replace-cross
Abstract: Recent Video-to-Audio (V2A) methods have achieved remarkable progress, enabling the synthesis of realistic, high-quality audio. However, they struggle with fine-grained temporal control in mult…