cs.CV

Multi-modal, multi-scale representation learning for satellite imagery analysis just needs a good ALiBi

arXiv:2604.10347v1 Announce Type: new
Abstract: Vision foundation models have been shown to be effective at processing satellite imagery into representations fit for downstream tasks, however, creating models which operate over multiple spatial resolu…