Real-Time Visual Attribution Streaming in Thinking Model
arXiv:2604.16587v1 Announce Type: new
Abstract: We present an amortized framework for real-time visual attribution streaming in multimodal thinking models. When these models generate code from a screenshot or solve math problems from images, their lon…