cs.AI, cs.CL

Why Do LLM-based Web Agents Fail? A Hierarchical Planning Perspective

arXiv:2603.14248v2 Announce Type: replace-cross
Abstract: Large language model (LLM) web agents are increasingly used for web navigation but remain far from human reliability on realistic, long-horizon tasks. Existing evaluations focus primarily on en…