cs.LG, math.OC

Adjoint Matching through the Lens of the Stochastic Maximum Principle in Optimal Control

arXiv:2604.08580v1 Announce Type: cross
Abstract: Reward fine-tuning of diffusion and flow models and sampling from tilted or Boltzmann distributions can both be formulated as stochastic optimal control (SOC) problems, where learning an optimal genera…