ClickAIXR: On-Device Multimodal Vision-Language Interaction with Real-World Objects in Extended Reality
arXiv:2604.04905v1 Announce Type: new
Abstract: We present ClickAIXR, a novel on-device framework for multimodal vision-language interaction with objects in extended reality (XR). Unlike prior systems that rely on cloud-based AI (e.g., ChatGPT) or gaz…