cs.LG

Reward Shaping and Action Masking for Compositional Tasks using Behavior Trees and LLMs

arXiv:2605.05795v1 Announce Type: new
Abstract: Decomposing complex tasks into a sequence of simpler subtasks can improve learning efficiency for an autonomous agent. Reinforcement learning (RL) can be used to optimize agent policies to complete subta…