ChemPro: A Progressive Chemistry Benchmark for Large Language Models
arXiv:2602.03108v4 Announce Type: replace
Abstract: We introduce ChemPro, a progressive benchmark with 4100 natural language question-answer pairs in Chemistry, across 4 coherent sections of difficulty designed to assess the proficiency of Large Langu…