cs.DC, cs.LG, math.OC, stat.ML

Rennala MVR: Improved Time Complexity for Parallel Stochastic Optimization via Momentum-Based Variance Reduction

arXiv:2605.08871v1 Announce Type: cross
Abstract: Large-scale machine learning models are trained on clusters of machines that exhibit heterogeneous performance due to hardware variability, network delays, and system-level instabilities. In such envir…