cs.LG, math.AP, stat.ML

Two-Time-Scale Learning Dynamics: A Population View of Neural Network Training

arXiv:2603.19808v2 Announce Type: replace
Abstract: Population-based learning paradigms, including evolutionary strategies, Population-Based Training (PBT), and recent model-merging methods, combine fast within-model optimisation with slower populatio…