cs.AI

Faithful Mobile GUI Agents with Guided Advantage Estimator

arXiv:2605.01208v1 Announce Type: new
Abstract: Vision-language model based graphical user interface (GUI) agents have shown strong interaction capabilities. However, they often behave unfaithfully, relying on memorized shortcuts rather than grounding…