Physics-Informed Causal MDPs for Sequential Constraint Repair in Engineering Simulation Pipelines
arXiv:2604.17910v1 Announce Type: cross
Abstract: Off-policy learning in constrained MDPs with large binary state spaces faces a fundamental tension: causal identification of transition dynamics requires structural assumptions, while sample-efficient …