Learning to Route Queries to Heads for Attention-based Re-ranking with Large Language Models
arXiv:2604.24608v1 Announce Type: cross
Abstract: Large Language Models (LLMs) have recently been explored as fine-grained zero-shot re-rankers by leveraging attention signals to estimate document relevance. However, existing methods either aggregate …