cs.AI, cs.LG

Beyond Noisy-TVs: Noise-Robust Exploration Via Learning Progress Monitoring

arXiv:2509.25438v2 Announce Type: replace
Abstract: When there exists an unlearnable source of randomness (noisy-TV) in the environment, a naively intrinsic reward driven exploring agent gets stuck at that source of randomness and fails at exploration…