cs.AI, cs.LG

Leveraging Human Feedback for Semantically-Relevant Skill Discovery

arXiv:2604.24127v1 Announce Type: cross
Abstract: Unsupervised skill discovery in reinforcement learning aims to intrinsically motivate agents to discover diverse and useful behaviours. However, unconstrained approaches can produce unsafe, unethical, …