Beyond Point Estimates: Distributional Uncertainty in Machine Learning Performance Evaluation
arXiv:2501.16931v2 Announce Type: replace
Abstract: Machine learning models are often evaluated using point estimates of performance metrics such as accuracy, F1 score, or mean squared error. Such summaries fail to capture the inherent variability ind…