Another BRIXEL in the Wall: Towards Cheaper Dense Features
arXiv:2511.05168v2 Announce Type: replace-cross
Abstract: Vision foundation models achieve strong performance on both global and locally dense downstream tasks. Pretrained on large images, the recent DINOv3 model family is able to produce very fine-gr…