cs.CL, cs.LG

MEME: Multi-entity & Evolving Memory Evaluation

arXiv:2605.12477v1 Announce Type: cross
Abstract: LLM-based agents increasingly operate in persistent environments where they must store, update, and reason over information across many sessions. While prior benchmarks evaluate only single-entity upda…