Math Takes Two: A test for emergent mathematical reasoning in communication
arXiv:2604.21935v1 Announce Type: new
Abstract: Although language models demonstrate remarkable proficiency on mathematical benchmarks, it remains unclear whether this reflects true mathematical reasoning or statistical pattern matching over learning …