SimBench: Benchmarking the Ability of Large Language Models to Simulate Human Behaviors
arXiv:2510.17516v4 Announce Type: replace
Abstract: Large language model (LLM) simulations of human behavior have the potential to revolutionize the social and behavioral sciences, if and only if they faithfully reflect real human behaviors. Current e…