cs.CL, cs.LG

Less Is More: Cognitive Load and the Single-Prompt Ceiling in LLM Mathematical Reasoning

arXiv:2604.18897v1 Announce Type: cross
Abstract: We present a systematic empirical study of prompt engineering for formal mathematical reasoning in the context of the SAIR Equational Theories Stage 1 competition. The task requires deciding whether on…