Detecting Media Clones in Cultural Repositories Using a Positive Unlabeled Learning Approach
arXiv:2604.04071v1 Announce Type: new
Abstract: We formulate curator-in-the-loop duplicate discovery in the AtticPOT repository as a Positive-Unlabeled (PU) learning problem. Given a single anchor per artefact, we train a lightweight per-query Clone E…