cs.AI, cs.CL

Measuring Representation Robustness in Large Language Models for Geometry

arXiv:2604.16421v1 Announce Type: new
Abstract: Large language models (LLMs) are increasingly evaluated on mathematical reasoning, yet their robustness to equivalent problem representations remains poorly understood. In geometry, identical problems ca…