Qi Zhang, Dawei Wang, Shaofeng Zou

Step-level Denoising-time Diffusion Alignment with Multiple Objectives

Qi Zhang, Dawei Wang, Shaofeng Zou / April 17, 2026

arXiv:2604.14379v1 Announce Type: new
Abstract: Reinforcement learning (RL) has emerged as a powerful tool for aligning diffusion models with human preferences, typically by optimizing a single reward function under a KL regularization constraint. In …

Author name: Qi Zhang, Dawei Wang, Shaofeng Zou

Step-level Denoising-time Diffusion Alignment with Multiple Objectives