SWE Context Bench: A Benchmark for Context Learning in Coding
arXiv:2602.08316v2 Announce Type: replace-cross
Abstract: Large language models are increasingly used as programming agents for repository level software engineering tasks. While recent benchmarks evaluate correctness in realistic codebases, they larg…