cs.CL

Decoupling the Effect of Chain-of-Thought Reasoning: A Human Label Variation Perspective

arXiv:2601.03154v2 Announce Type: replace
Abstract: Reasoning-tuned LLMs utilizing long Chain-of-Thought (CoT) excel at single-answer tasks, yet their ability to model Human Label Variation–which requires capturing probabilistic ambiguity rather than…