cs.CL, cs.CV, cs.LG

Mem-W: Latent Memory-Native GUI Agents

arXiv:2605.09317v1 Announce Type: cross
Abstract: GUI agents are beginning to operate the web, mobile, and desktop as interactive worlds, where successful control depends on carrying forward visual, procedural, and task-level evidence beyond the fleet…