- Provide.ai - Page 323

TAG: Target-Agnostic Guidance for Stable Object-Centric Inference in Vision-Language-Action Models

/ March 26, 2026

arXiv:2603.24584v1 Announce Type: new
Abstract: Vision–Language–Action (VLA) policies have shown strong progress in mapping language instructions and visual observations to robotic actions, yet their reliability degrades in cluttered scenes with dis…

cs.CV

Distribution Matching Distillation Meets Reinforcement Learning

/ March 26, 2026

arXiv:2511.13649v4 Announce Type: replace
Abstract: Distribution Matching Distillation (DMD) facilitates efficient inference by distilling multi-step diffusion models into few-step variants. Concurrently, Reinforcement Learning (RL) has emerged as a v…

cs.CV

3D-LLDM: Label-Guided 3D Latent Diffusion Model for Improving High-Resolution Synthetic MR Imaging in Hepatic Structure Segmentation

/ March 26, 2026

arXiv:2603.23845v1 Announce Type: new
Abstract: Deep learning and generative models are advancing rapidly, with synthetic data increasingly being integrated into training pipelines for downstream analysis tasks. However, in medical imaging, their adop…

cs.CV

SERA-H: Beyond Native Sentinel Spatial Limits for High-Resolution Canopy Height Mapping

/ March 26, 2026

arXiv:2512.18128v3 Announce Type: replace
Abstract: High-resolution mapping of canopy height is essential for forest management and biodiversity monitoring. Although recent studies have led to the advent of deep learning methods using satellite imager…

cs.CV

MLE-UVAD: Minimal Latent Entropy Autoencoder for Fully Unsupervised Video Anomaly Detection

/ March 26, 2026

arXiv:2603.23868v1 Announce Type: new
Abstract: In this paper, we address the challenging problem of single-scene, fully unsupervised video anomaly detection (VAD), where raw videos containing both normal and abnormal events are used directly for trai…

cs.CV

Dehallu3D: Hallucination-Mitigated 3D Generation from Single Image via Cyclic View Consistency Refinement

/ March 26, 2026

arXiv:2603.01601v2 Announce Type: replace
Abstract: Large 3D reconstruction models have revolutionized the 3D content generation field, enabling broad applications in virtual reality and gaming. Just like other large models, large 3D reconstruction mo…

cs.AI, cs.CV, cs.LG

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

/ March 26, 2026

arXiv:2603.24440v1 Announce Type: new
Abstract: Computer-use agents (CUAs) hold great promise for automating complex desktop workflows, yet progress toward general-purpose agents is bottlenecked by the scarcity of continuous, high-quality human demons…

cs.LG, physics.flu-dyn

Project and Generate: Divergence-Free Neural Operators for Incompressible Flows

/ March 26, 2026

arXiv:2603.24500v1 Announce Type: new
Abstract: Learning-based models for fluid dynamics often operate in unconstrained function spaces, leading to physically inadmissible, unstable simulations. While penalty-based methods offer soft regularization, t…

cs.CL

PINGALA: Prosody-Aware Decoding for Sanskrit Poetry Generation

/ March 26, 2026

arXiv:2603.24413v1 Announce Type: new
Abstract: Poetry generation in Sanskrit typically requires the verse to be semantically coherent and adhere to strict prosodic rules. In Sanskrit prosody, every line of a verse is typically a fixed length sequence…

stat.AP, stat.CO, stat.ME, stat.ML

Adaptive Gaussian Process Search for Simulation-Based Sample Size Estimation in Clinical Prediction Models: Validation of the pmsims R Package

/ March 26, 2026

arXiv:2603.23688v1 Announce Type: cross
Abstract: Background: Determining an adequate sample size is essential for developing reliable and generalisable clinical prediction models, yet practical guidance on selecting appropriate methods remains limite…