MemGround: Long-Term Memory Evaluation Kit for Large Language Models in Gamified Scenarios
arXiv:2604.14158v1 Announce Type: new
Abstract: Current evaluations of long-term memory in LLMs are fundamentally static. By fixating on simple retrieval and short-context inference, they neglect the multifaceted nature of complex memory systems, such…