cs.CV

GoClick: Lightweight Element Grounding Model for Autonomous GUI Interaction

arXiv:2604.23941v1 Announce Type: new
Abstract: Graphical User Interface (GUI) element grounding (precisely locating elements on screenshots based on natural language instructions) is fundamental for agents interacting with GUIs. Deploying this capabi…