SciIntegrity-Bench: A Benchmark for Evaluating Academic Integrity in AI Scientist Systems
arXiv:2605.10246v1 Announce Type: new
Abstract: AI scientist systems are increasingly deployed for autonomous research, yet their academic integrity has never been systematically evaluated. We introduce SCIINTEGRITY-BENCH, the first benchmark designed…