cs.CL, cs.ET, cs.HC, cs.IR

From Speech-to-Spatial: Grounding Utterances on A Live Shared View with Augmented Reality

arXiv:2602.03059v2 Announce Type: replace-cross
Abstract: We introduce Speech-to-Spatial, a referent disambiguation framework that converts verbal remote-assistance instructions into spatially grounded AR guidance. Unlike prior systems that rely on ad…