cs.CV, cs.MM, cs.SD

TurboTalk: Progressive Distillation for One-Step Audio-Driven Talking Avatar Generation

arXiv:2604.14580v1 Announce Type: new
Abstract: Existing audio-driven video digital human generation models rely on multi-step denoising, resulting in substantial computational overhead that severely limits their deployment in real-world settings. Whi…