Training-Time Batch Normalization Reshapes Local Partition Geometry in Piecewise-Affine Networks
arXiv:2605.04946v1 Announce Type: cross
Abstract: Batch normalization (BN) is central to modern deep networks, but its effect on the realized function during training remains less understood than its optimization benefits. We study training-time BN in…