cs.RO

MotuBrain: An Advanced World Action Model for Robot Control

arXiv:2604.27792v2 Announce Type: replace
Abstract: Vision-Language-Action (VLA) models generalize semantically well but often lack fine-grained modeling of world dynamics. We present MotuBrain, a unified World Action Model that jointly models video a…

cs.CL

Reward Modeling from Natural Language Human Feedback

arXiv:2601.07349v3 Announce Type: replace
Abstract: Reinforcement Learning with Verifiable reward (RLVR) on preference data has become the mainstream approach for training Generative Reward Models (GRMs). Typically in pairwise rewarding tasks, GRMs ge…

cs.CL

SCOPE:Planning for Hybrid Querying over Clinical Trial Data

arXiv:2604.25120v2 Announce Type: replace
Abstract: We study clinical trial table reasoning, where answers are not directly stored in visible cells but must be reasoned from semantic understanding through normalization, classification, extraction, or …

Scroll to Top