Robust stochastic first order methods in heavy-tailed noise via medoid mini-batch gradient sampling
arXiv:2605.07634v1 Announce Type: cross
Abstract: We consider a first order stochastic optimization framework where, at each iteration, $K$ independent identically distributed (i.i.d.) data point samples are drawn, based on which stochastic gradients …