CoEvolve: Training LLM Agents via Agent-Data Mutual Evolution
arXiv:2604.15840v1 Announce Type: new
Abstract: Reinforcement learning for LLM agents is typically conducted on a static data distribution, which fails to adapt to the agent’s evolving behavior and leads to poor coverage of complex environment interac…