Pratik Deshmukh, Atirek Gupta

On Semantic Loss Fine-Tuning Approach for Preventing Model Collapse in Causal Reasoning

Pratik Deshmukh, Atirek Gupta / May 8, 2026

arXiv:2605.05438v1 Announce Type: new
Abstract: Standard fine-tuning of transformer models on causal reasoning tasks leads to catastrophic model collapse, where models learn trivial solutions such as always predicting “Yes” or “No” regardless of input…

Author name: Pratik Deshmukh, Atirek Gupta

On Semantic Loss Fine-Tuning Approach for Preventing Model Collapse in Causal Reasoning