cs.AI, cs.CL, cs.CR

LATTICE: Evaluating Decision Support Utility of Crypto Agents

arXiv:2604.26235v1 Announce Type: cross
Abstract: We introduce LATTICE, a benchmark for evaluating the decision support utility of crypto agents in realistic user-facing scenarios. Prior crypto agent benchmarks mainly focus on reasoning-based or outco…