Tempered Sequential Monte Carlo for Trajectory and Policy Optimization with Differentiable Dynamics
arXiv:2604.21456v1 Announce Type: cross
Abstract: We propose a sampling-based framework for finite-horizon trajectory and policy optimization under differentiable dynamics by casting controller design as inference. Specifically, we minimize a KL-regul…