PRISM: : Planning and Reasoning with Intent in Simulated Embodied Environments
arXiv:2605.11534v1 Announce Type: new
Abstract: When an LLM-based embodied agent fails at a household task, the culprit could be misidentified objects, forgotten sub-goals, or poor action sequencing — yet existing benchmarks report only a single succ…