PolySLGen: Online Multimodal Speaking-Listening Reaction Generation in Polyadic Interaction
arXiv:2604.08125v2 Announce Type: replace
Abstract: Human-like multimodal reaction generation is essential for natural group interactions between humans and embodied AI. However, existing approaches are limited to single-modality or speaking-only resp…