cs.LG

Data-Efficient RLVR via Off-Policy Influence Guidance

arXiv:2510.26491v2 Announce Type: replace
Abstract: Data selection is a critical aspect of Reinforcement Learning with Verifiable Rewards (RLVR) for enhancing the reasoning capabilities of large language models (LLMs). Current data selection methods a…

cs.LG

Guided Transfer Learning for Discrete Diffusion Models

arXiv:2512.10877v4 Announce Type: replace
Abstract: Discrete diffusion models (DMs) have achieved strong performance in language and other discrete domains, offering a compelling alternative to autoregressive modeling. Yet this performance typically d…

cs.CV

Geometric Context Transformer for Streaming 3D Reconstruction

arXiv:2604.14141v1 Announce Type: new
Abstract: Streaming 3D reconstruction aims to recover 3D information, such as camera poses and point clouds, from a video stream, which necessitates geometric accuracy, temporal
consistency, and computational ef…

Scroll to Top