cs.AI, cs.CV, cs.LG

TRIP-Evaluate: An Open Multimodal Benchmark for Evaluating Large Models in Transportation

arXiv:2605.00907v1 Announce Type: cross
Abstract: Large language models (LLMs) and multimodal large models (MLLMs) are increasingly used for transportation tasks such as regulation question answering, traffic management support, engineering review, an…