cs.CL, hep-lat, hep-ph, physics.comp-ph, physics.optics

PRBench: End-to-end Paper Reproduction in Physics Research

arXiv:2603.27646v1 Announce Type: new
Abstract: AI agents powered by large language models exhibit strong reasoning and problem-solving capabilities, enabling them to assist scientific research tasks such as formula derivation and code generation. How…