TopoPrune: Robust Data Pruning via Unified Latent Space Topology
arXiv:2602.02739v2 Announce Type: replace
Abstract: Geometric data pruning methods, while practical for leveraging pretrained models, are fundamentally unstable. Their reliance on extrinsic geometry renders them highly sensitive to latent space pertur…