Kihyun Yu, Seoungbin Bae, Dabeen Lee

Near-Optimal Primal-Dual Algorithm for Learning Linear Mixture CMDPs with Adversarial Rewards

Kihyun Yu, Seoungbin Bae, Dabeen Lee / March 31, 2026

arXiv:2603.27884v1 Announce Type: new
Abstract: We study safe reinforcement learning in finite-horizon linear mixture constrained Markov decision processes (CMDPs) with adversarial rewards under full-information feedback and an unknown transition kern…

Author name: Kihyun Yu, Seoungbin Bae, Dabeen Lee

Near-Optimal Primal-Dual Algorithm for Learning Linear Mixture CMDPs with Adversarial Rewards