CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing
arXiv:2605.02910v1 Announce Type: cross
Abstract: Recent advances in large language models have led to strong performance on reasoning and environment-interaction tasks, yet their ability for creative problem-solving remains underexplored. We study th…