Efficient Multivector Retrieval with Token-Aware Clustering and Hierarchical Indexing
arXiv:2604.28142v1 Announce Type: cross
Abstract: Multivector retrieval models achieve state-of-the-art effectiveness through fine-grained token-level representations, but their deployment incurs substantial computational and memory costs. Current sol…