Tool Learning Needs Nothing More Than a Free 8B Language Model
arXiv:2604.17739v1 Announce Type: cross
Abstract: Reinforcement learning (RL) has become a prevalent paradigm for training tool calling agents, which typically requires online interactive environments. Existing approaches either rely on training data …