cs.LG, stat.ME, stat.ML

Nearly Optimal Subdata Selection

arXiv:2604.23930v1 Announce Type: cross
Abstract: When, in terms of the number of data points, the size of a dataset exceeds available computing resources, or when labeling is expensive, an attractive solution consists of selecting only some of the da…