cs.AI, cs.LG

Android Coach: Improve Online Agentic Training Efficiency with Single State Multiple Actions

arXiv:2604.07277v1 Announce Type: cross
Abstract: Online reinforcement learning (RL) serves as an effective method for enhancing the capabilities of Android agents. However, guiding agents to learn through online interaction is prohibitively expensive…