cs.AI, cs.CL, cs.CY, cs.LG

SimBench: Benchmarking the Ability of Large Language Models to Simulate Human Behaviors

arXiv:2510.17516v4 Announce Type: replace
Abstract: Large language model (LLM) simulations of human behavior have the potential to revolutionize the social and behavioral sciences, if and only if they faithfully reflect real human behaviors. Current e…