cs.AI, cs.CV

Moondream Segmentation: From Words to Masks

arXiv:2604.02593v1 Announce Type: new
Abstract: We present Moondream Segmentation, a referring image segmentation extension of Moondream 3, a vision-language model. Given an image and a referring expression, the model autoregressively decodes a vector…