cs.AI, cs.CL, cs.LG

SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization

arXiv:2602.04811v2 Announce Type: replace-cross
Abstract: True self-evolution requires agents to act as lifelong learners that internalize novel experiences to solve future problems. However, rigorously measuring this foundational capability is hinder…