Transferable Delay-Aware Reinforcement Learning via Implicit Causal Graph Modeling
arXiv:2605.12312v1 Announce Type: new
Abstract: Random delays weaken the temporal correspondence between actions and subsequent state feedback, making it difficult for agents to identify the true propagation process of action effects. In cross-task sc…