cs.LG, cs.NE, cs.RO

Distributional Value Estimation Without Target Networks for Robust Quality-Diversity

arXiv:2604.20381v1 Announce Type: new
Abstract: Quality-Diversity (QD) algorithms excel at discovering diverse repertoires of skills, but are hindered by poor sample efficiency and often require tens of millions of environment steps to solve complex l…