MemEvoBench: Benchmarking Memory MisEvolution in LLM Agents
arXiv:2604.15774v1 Announce Type: new
Abstract: Equipping Large Language Models (LLMs) with persistent memory enhances interaction continuity and personalization but introduces new safety risks. Specifically, contaminated or biased memory accumulation…