Integrating Causal DAGs in Deep RL: Activating Minimal Markovian States with Multi-Order Exposure
arXiv:2605.07057v1 Announce Type: new
Abstract: Online reinforcement learning (RL) relies on the Markov property for guaranteed performance, but real-world applications often lack well-defined states given raw observed variables. While causal RL has a…