cs.AI, cs.LG, cs.SE

OpenClassGen: A Large-Scale Corpus of Real-World Python Classes for LLM Research

arXiv:2504.15564v3 Announce Type: replace-cross
Abstract: Existing class-level code generation datasets are either synthetic (ClassEval: 100 classes) or insufficient in scale for modern training needs (RealClassEval: 400 classes), hindering robust eva…