- Provide.ai - Page 11

Driving Intents Amplify Planning-Oriented Reinforcement Learning

/ May 15, 2026

arXiv:2605.12625v2 Announce Type: replace
Abstract: Continuous-action policies trained on a single demonstrated trajectory per scene suffer from mode collapse: samples cluster around the demonstrated maneuver and the policy cannot represent semantical…

cs.CR, cs.CV, cs.LG, cs.RO

Systematic Discovery of Semantic Attacks in Online Map Construction through Conditional Diffusion

/ May 15, 2026

arXiv:2605.14396v1 Announce Type: cross
Abstract: Autonomous vehicles depend on online HD map construction to perceive lane boundaries, dividers, and pedestrian crossings — safety-critical road elements that directly govern motion planning. While exi…

cs.CV, cs.RO

Co-Me: Confidence-Guided Token Merging for Visual Geometric Transformers

/ May 15, 2026

arXiv:2511.14751v2 Announce Type: replace-cross
Abstract: We propose Confidence-Guided Token Merging (Co-Me), an acceleration mechanism for visual geometric transformers without retraining or finetuning the base model. Co-Me distilled a light-weight c…

cs.DC, cs.GR, cs.NA, cs.RO, math.NA

DiffPhD: A Unified Differentiable Solver for Projective Heterogeneous Materials in Elastodynamics with Contact-Rich GPU-Acceleration

/ May 15, 2026

arXiv:2605.14526v1 Announce Type: cross
Abstract: Differentiable simulation of soft bodies is a foundation for system identification, trajectory optimization, and Real2Sim transfer. Yet, existing methods such as the differentiable Projective Dynamics …

cs.CL, cs.HC

A Formative Study of Brief Affective Text as a Complement to Wearable Sensing for Longitudinal Student Health Monitoring

/ May 15, 2026

arXiv:2605.14360v1 Announce Type: cross
Abstract: Wearable devices capture physiological and behavioral data with increasing fidelity, but the psychological context shaping these outcomes is difficult to recover from sensor data alone, limiting passiv…

cs.LG

BiTrajDiff: Bidirectional Trajectory Generation with Diffusion Models for Offline Reinforcement Learning

/ May 15, 2026

arXiv:2506.05762v5 Announce Type: replace
Abstract: Recent advances in offline Reinforcement Learning (RL) have proven that effective policy learning can benefit from imposing conservative constraints on pre-collected datasets. However, such static da…

cs.AI, cs.RO

D-VLA: A High-Concurrency Distributed Asynchronous Reinforcement Learning Framework for Vision-Language-Action Models

/ May 15, 2026

arXiv:2605.13276v2 Announce Type: replace-cross
Abstract: The rapid evolution of Embodied AI has enabled Vision-Language-Action (VLA) models to excel in multimodal perception and task execution. However, applying Reinforcement Learning (RL) to these m…

cs.LG

DeepTokenEEG Enhancing Mild Cognitive Impairment and Alzheimers Classification via Tokenized EEG Features

/ May 15, 2026

arXiv:2605.15009v1 Announce Type: new
Abstract: The detection of Alzheimers disease (AD) is considered crucial, as timely intervention can improve patient outcomes. Electroencephalogram (EEG)-based diagnosis has been recognized as a non-invasive, acce…

cs.CL, cs.LG

NodeSynth: Socially Aligned Synthetic Data for AI Evaluation

/ May 15, 2026

arXiv:2605.14381v1 Announce Type: new
Abstract: Recent advancements in generative AI facilitate large-scale synthetic data generation for model evaluation. However, without targeted approaches, these datasets often lack the sociotechnical nuance requi…

cs.CL, cs.LG, stat.ML

Knowing When to Quit: A Principled Framework for Dynamic Abstention in LLM Reasoning

/ May 15, 2026

arXiv:2604.18419v3 Announce Type: replace-cross
Abstract: LLMs utilizing chain-of-thought reasoning often waste substantial compute by producing long, incorrect responses. Abstention can mitigate this by withholding outputs unlikely to be correct. Whi…