cs.CV, cs.MM, cs.SD

PAVAS: Physics-Aware Video-to-Audio Synthesis

arXiv:2512.08282v2 Announce Type: replace
Abstract: Recent advances in Video-to-Audio (V2A) generation have achieved impressive perceptual quality and temporal synchronization, yet most models remain appearance-driven, capturing visual-acoustic correl…