Learning to Generate Formally Verifiable Step-by-Step Logic Reasoning via Structured Formal Intermediaries
arXiv:2603.29500v1 Announce Type: cross
Abstract: Large language models (LLMs) have recently demonstrated impressive performance on complex, multi-step reasoning tasks, especially when post-trained with outcome-rewarded reinforcement learning Guo et a…