cs.CV

Tokenizing Semantic Segmentation with Run Length Encoding

arXiv:2602.21627v3 Announce Type: replace
Abstract: This paper presents a new unified approach to semantic segmentation in both images and videos by using language modeling to output the masks as sequences of discrete tokens. We use run length encodin…