- Provide.ai - Page 482

Language Models Can Explain Visual Features via Steering

/ March 26, 2026

arXiv:2603.22593v2 Announce Type: replace-cross
Abstract: Sparse Autoencoders uncover thousands of features in vision models, yet explaining these features without requiring human intervention remains an open challenge. While previous work has propose…

cs.CV

CoRe: Joint Optimization with Contrastive Learning for Medical Image Registration

/ March 26, 2026

arXiv:2603.23694v1 Announce Type: new
Abstract: Medical image registration is a fundamental task in medical image analysis, enabling the alignment of images from different modalities or time points. However, intensity inconsistencies and nonlinear tis…

cs.AI

Relationship-Aware Safety Unlearning for Multimodal LLMs

/ March 26, 2026

arXiv:2603.14185v3 Announce Type: replace
Abstract: Generative multimodal models can exhibit safety failures that are inherently relational: two benign concepts can become unsafe when linked by a specific action or relation (e.g., child-drinking-wine)…

cs.CV

GaINeR: Geometry-Aware Implicit Network Representation

/ March 26, 2026

arXiv:2511.20924v2 Announce Type: replace
Abstract: Implicit Neural Representations (INRs) are widely used for modeling continuous 2D images, enabling high-fidelity reconstruction, super-resolution, and compression. Architectures such as SIREN, WIRE, …

cs.CV, cs.RO

TAG: Target-Agnostic Guidance for Stable Object-Centric Inference in Vision-Language-Action Models

/ March 26, 2026

arXiv:2603.24584v1 Announce Type: new
Abstract: Vision–Language–Action (VLA) policies have shown strong progress in mapping language instructions and visual observations to robotic actions, yet their reliability degrades in cluttered scenes with dis…

cs.CV

Distribution Matching Distillation Meets Reinforcement Learning

/ March 26, 2026

arXiv:2511.13649v4 Announce Type: replace
Abstract: Distribution Matching Distillation (DMD) facilitates efficient inference by distilling multi-step diffusion models into few-step variants. Concurrently, Reinforcement Learning (RL) has emerged as a v…

cs.CV

3D-LLDM: Label-Guided 3D Latent Diffusion Model for Improving High-Resolution Synthetic MR Imaging in Hepatic Structure Segmentation

/ March 26, 2026

arXiv:2603.23845v1 Announce Type: new
Abstract: Deep learning and generative models are advancing rapidly, with synthetic data increasingly being integrated into training pipelines for downstream analysis tasks. However, in medical imaging, their adop…

cs.CV

SERA-H: Beyond Native Sentinel Spatial Limits for High-Resolution Canopy Height Mapping

/ March 26, 2026

arXiv:2512.18128v3 Announce Type: replace
Abstract: High-resolution mapping of canopy height is essential for forest management and biodiversity monitoring. Although recent studies have led to the advent of deep learning methods using satellite imager…

cs.CV

MLE-UVAD: Minimal Latent Entropy Autoencoder for Fully Unsupervised Video Anomaly Detection

/ March 26, 2026

arXiv:2603.23868v1 Announce Type: new
Abstract: In this paper, we address the challenging problem of single-scene, fully unsupervised video anomaly detection (VAD), where raw videos containing both normal and abnormal events are used directly for trai…

cs.CV

Dehallu3D: Hallucination-Mitigated 3D Generation from Single Image via Cyclic View Consistency Refinement

/ March 26, 2026

arXiv:2603.01601v2 Announce Type: replace
Abstract: Large 3D reconstruction models have revolutionized the 3D content generation field, enabling broad applications in virtual reality and gaming. Just like other large models, large 3D reconstruction mo…