CASP: Support-Aware Offline Policy Selection for Two-Stage Recommender Systems
arXiv:2604.23022v1 Announce Type: cross
Abstract: Two-stage recommender systems first choose a candidate generator and then rank items within the generated set. Because the generator decides which items are available to the ranker, changing the genera…