- Provide.ai - Page 436

OPRIDE: Offline Preference-based Reinforcement Learning via In-Dataset Exploration

/ April 6, 2026

arXiv:2604.02349v1 Announce Type: new
Abstract: Preference-based reinforcement learning (PbRL) can help avoid sophisticated reward designs and align better with human intentions, showing great promise in various real-world applications. However, obtai…

cs.LG, physics.comp-ph

Real-Time Surrogate Modeling for Personalized Blood Flow Prediction and Hemodynamic Analysis

/ April 6, 2026

arXiv:2604.03197v1 Announce Type: new
Abstract: Cardiovascular modeling has rapidly advanced over the past few decades due to the rising needs for health tracking and early detection of cardiovascular diseases. While 1-D arterial models offer an attra…

cs.CV, cs.RO, cs.SY, eess.SY

A Rapid Instrument Exchange System for Humanoid Robots in Minimally Invasive Surgery

/ April 6, 2026

arXiv:2604.02707v1 Announce Type: cross
Abstract: Humanoid robot technologies have demonstrated immense potential for minimally invasive surgery (MIS). Unlike dedicated multi-arm surgical platforms, the inherent dual-arm configuration of humanoid robo…

cs.AI, cs.CV, cs.LG

MOMO: Mars Orbital Model Foundation Model for Mars Orbital Applications

/ April 6, 2026

arXiv:2604.02719v1 Announce Type: cross
Abstract: We introduce MOMO, the first multi-sensor foundation model for Mars remote sensing. MOMO uses model merge to integrate representations learned independently from three key Martian sensors (HiRISE, CTX,…

cs.LG

When RL Meets Adaptive Speculative Training: A Unified Training-Serving System

/ April 6, 2026

arXiv:2602.06932v2 Announce Type: replace
Abstract: Speculative decoding can significantly accelerate LLM serving, yet most deployments today disentangle speculator training from serving, treating speculator training as a standalone offline modeling p…

cs.AI, cs.CL, cs.CY

Verbalizing LLMs’ assumptions to explain and control sycophancy

/ April 6, 2026

arXiv:2604.03058v1 Announce Type: cross
Abstract: LLMs can be socially sycophantic, affirming users when they ask questions like “am I in the wrong?” rather than providing genuine assessment. We hypothesize that this behavior arises from incorrect ass…

cs.LG

Hierarchical Planning with Latent World Models

/ April 6, 2026

arXiv:2604.03208v1 Announce Type: new
Abstract: Model predictive control (MPC) with learned world models has emerged as a promising paradigm for embodied control, particularly for its ability to generalize zero-shot when deployed in new environments. …

cs.AI, cs.LG

Equivariant Evidential Deep Learning for Interatomic Potentials

/ April 6, 2026

arXiv:2602.10419v2 Announce Type: replace
Abstract: Uncertainty quantification (UQ) is critical for assessing the reliability of machine learning interatomic potentials (MLIPs) in molecular dynamics (MD) simulations, identifying extrapolation regimes …

cs.CV

Parser-Oriented Structural Refinement for a Stable Layout Interface in Document Parsing

/ April 6, 2026

arXiv:2604.02692v1 Announce Type: new
Abstract: Accurate document parsing requires both robust content recognition and a stable parser interface. In explicit Document Layout Analysis (DLA) pipelines, downstream parsers do not consume the full detector…

cs.AI, cs.CV

DocShield: Towards AI Document Safety via Evidence-Grounded Agentic Reasoning

/ April 6, 2026

arXiv:2604.02694v1 Announce Type: new
Abstract: The rapid progress of generative AI has enabled increasingly realistic text-centric image forgeries, posing major challenges to document safety. Existing forensic methods mainly rely on visual cues and l…