Adaptive Symmetrization of the KL Divergence
arXiv:2511.11159v2 Announce Type: replace
Abstract: Many tasks in machine learning can be described as or reduced to learning a probability distribution given a finite set of samples. A common approach is to minimize a statistical divergence between t…