- Provide.ai - Page 12

How Does the Lagrangian Guide Safe Reinforcement Learning through Diffusion Models?

/ May 7, 2026

arXiv:2602.02924v2 Announce Type: replace
Abstract: Diffusion policy sampling enables reinforcement learning (RL) to represent multimodal action distributions beyond suboptimal unimodal Gaussian policies. However, existing diffusion-based RL methods p…

cs.CV

Deep Reprogramming Distillation for Medical Foundation Models

/ May 7, 2026

arXiv:2605.04447v1 Announce Type: new
Abstract: Medical foundation models pre-trained on large-scale datasets have shown powerful versatile performance. However, when adapting medical foundation models for specific medical scenarios, it remains the in…

cs.AI, cs.CV, cs.LG

What Matters in Practical Learned Image Compression

/ May 7, 2026

arXiv:2605.05148v1 Announce Type: new
Abstract: One of the major differentiators unlocked by learned codecs relative to their hard-coded traditional counterparts is their ability to be optimized directly to appeal to the human visual system. Despite t…

cs.CL, cs.LG

DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training

/ May 7, 2026

arXiv:2602.05890v2 Announce Type: replace
Abstract: Training reinforcement learning (RL) systems in real-world environments remains challenging due to noisy supervision and poor out-of-domain (OOD) generalization, especially in LLM post-training. Rece…

cs.AI, cs.CV

Aes3D: Aesthetic Assessment in 3D Gaussian Splatting

/ May 7, 2026

arXiv:2605.05155v1 Announce Type: new
Abstract: As 3D Gaussian Splatting (3DGS) gains attention in immersive media and digital content creation, assessing the aesthetics of 3D scenes becomes important in helping creators build more visually compelling…

cs.CV, eess.IV

Fully Guided Neural Schr\”odinger bridge for Brain MR image synthesis

/ May 7, 2026

arXiv:2501.14171v3 Announce Type: replace-cross
Abstract: Multi-modal brain MRI provides essential complementary information for clinical diagnosis. However, acquiring all modalities in practice is often constrained by time and cost. To address this, …

cs.LG

CuBridge: An LLM-Based Framework for Understanding and Reconstructing High-Performance Attention Kernels

/ May 7, 2026

arXiv:2605.05023v1 Announce Type: new
Abstract: Efficient CUDA implementations of attention mechanisms are critical to modern deep learning systems, yet supporting diverse and evolving attention variants remains challenging. Existing frameworks and co…

cs.AI, cs.CV

StableI2I: Spotting Unintended Changes in Image-to-Image Transition

/ May 7, 2026

arXiv:2605.04453v1 Announce Type: new
Abstract: In most real-world image-to-image (I2I) scenarios, existing evaluations primarily focus on instruction following and the perceptual quality or aesthetics of the generated images. However, they largely fa…

cs.AI, cs.CR, cs.LG

Undetectable Backdoors in Model Parameters: Hiding Sparse Secrets in High Dimensions

/ May 7, 2026

arXiv:2605.04209v1 Announce Type: cross
Abstract: We present Sparse Backdoor, a supply-chain attack that plants a \emph{provably undetectable} backdoor in pre-trained image classifiers, including convolutional networks and Vision Transformers. The att…

cs.CV

PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World

/ May 7, 2026

arXiv:2605.05163v1 Announce Type: new
Abstract: Synthesizing physics-grounded 3D assets is a critical bottleneck for interactive virtual worlds and embodied AI. Existing methods predominantly focus on static geometry, overlooking the functional proper…