Visual Enhanced Depth Scaling for Multimodal Latent Reasoning
arXiv:2604.10500v2 Announce Type: replace
Abstract: Multimodal latent reasoning has emerged as a promising paradigm that replaces explicit Chain-of-Thought (CoT) decoding with implicit feature propagation, simultaneously enhancing representation infor…