cs.CV, cs.SD

UniLS: End-to-End Audio-Driven Avatars for Unified Listening and Speaking

arXiv:2512.09327v2 Announce Type: replace
Abstract: Generating lifelike conversational avatars requires modeling not just isolated speakers, but the dynamic, reciprocal interaction of speaking and listening. However, modeling the listener is exception…