GroupMemBench: Benchmarking LLM Agent Memory in Multi-Party Conversations
arXiv:2605.14498v1 Announce Type: new
Abstract: Large Language Model (LLM) agents increasingly serve as personal assistants and workplace collaborators, where their utility depends on memory systems that extract, retrieve, and apply information across…