Learning to Retrieve from Agent Trajectories
arXiv:2604.04949v1 Announce Type: cross
Abstract: Information retrieval (IR) systems have traditionally been designed and trained for human users, with learning-to-rank methods relying heavily on large-scale human interaction logs such as clicks and d…