Dual Tuning for Reasoning Efficacy-Driven Data Curation in Multimodal LLM Training
arXiv:2603.04415v2 Announce Type: replace
Abstract: Reasoning post-training improves Large Language Models (LLMs) on complex tasks such as mathematics and coding, but its benefits across diverse multimodal tasks remains uncertain. The trend of releasi…