Visual Latents Know More Than They Say: Unsilencing Latent Reasoning in MLLMs
arXiv:2605.02735v1 Announce Type: new
Abstract: Continuous latent-space reasoning offers a compact alternative to textual chain-of-thought for multimodal models, enabling high-dimensional visual evidence to be integrated without explicit reasoning tok…