cs.AI, cs.SE

Needle in the Repo: A Benchmark for Maintainability in AI-Generated Repository Edits

arXiv:2603.27745v1 Announce Type: cross
Abstract: AI coding agents can now complete complex programming tasks, but existing evaluations largely emphasize behavioral correctness and often overlook maintainability risks such as weak modularity or testab…