cs.CL

MoCo: A One-Stop Shop for Model Collaboration Research

arXiv:2601.21257v2 Announce Type: replace
Abstract: Advancing beyond single monolithic language models (LMs), recent research increasingly recognizes the importance of model collaboration, where multiple LMs collaborate, compose, and complement each o…

cs.AI, cs.LG

Bounded Ratio Reinforcement Learning

arXiv:2604.18578v1 Announce Type: new
Abstract: Proximal Policy Optimization (PPO) has become the predominant algorithm for on-policy reinforcement learning due to its scalability and empirical robustness across domains. However, there is a significan…

cs.CV

WildDet3D: Scaling Promptable 3D Detection in the Wild

arXiv:2604.08626v2 Announce Type: replace
Abstract: Understanding objects in 3D from a single image is a cornerstone of spatial intelligence. A key step toward this goal is monocular 3D object detection–recovering the extent, location, and orientatio…

cs.CV, cs.LG

Vision Language Models are Biased

arXiv:2505.23941v4 Announce Type: replace
Abstract: Large language models (LLMs) memorize a vast amount of prior knowledge from the Internet that helps them on downstream tasks but also may notoriously sway their outputs towards wrong or biased answer…

Scroll to Top