Mohamed Aghzal, Gregory J. Stein, Ziyu Yao

Why Do LLM-based Web Agents Fail? A Hierarchical Planning Perspective

Mohamed Aghzal, Gregory J. Stein, Ziyu Yao / April 29, 2026

arXiv:2603.14248v2 Announce Type: replace-cross
Abstract: Large language model (LLM) web agents are increasingly used for web navigation but remain far from human reliability on realistic, long-horizon tasks. Existing evaluations focus primarily on en…

Author name: Mohamed Aghzal, Gregory J. Stein, Ziyu Yao

Why Do LLM-based Web Agents Fail? A Hierarchical Planning Perspective