Agentick: A Unified Benchmark for General Sequential Decision-Making Agents
arXiv:2605.06869v2 Announce Type: replace
Abstract: AI agent research spans a wide spectrum: from RL agents that learn from scratch to foundation model agents that leverage pre-trained knowledge, yet no unified benchmark enables fair comparison across…