Edit, But Verify: An Empirical Audit of Instructed Code-Editing Benchmarks
arXiv:2604.05100v1 Announce Type: cross
Abstract: Instructed code editing, where an LLM modifies existing code based on a natural language instruction, accounts for roughly 19% of real-world coding assistant interactions. Yet very few benchmarks direc…