MGA: Memory-Driven GUI Agent for Observation-Centric Interaction
arXiv:2510.24168v2 Announce Type: replace
Abstract: Multimodal Large Language Models (MLLMs) have significantly advanced GUI agents, yet long-horizon automation remains constrained by two critical bottlenecks: context overload from raw sequential traj…