- Provide.ai - Page 188

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

/ May 5, 2026

arXiv:2604.28123v2 Announce Type: replace-cross
Abstract: The standard post-training recipe for large multimodal models (LMMs) applies supervised fine-tuning (SFT) on curated demonstrations followed by reinforcement learning with verifiable rewards (R…

cs.AI, cs.LG

Recurrent Deep Reinforcement Learning for Chemotherapy Control under Partial Observability

/ May 5, 2026

arXiv:2605.02552v1 Announce Type: cross
Abstract: Chemotherapy dose optimization can be formulated as a dynamic treatment regime, requiring sequential decisions under uncertainty that must balance tumor suppression against toxicity. However, most rein…

cs.AI, cs.LG

Federated Semi-Supervised Graph Neural Networks with Prototype-Guided Pseudo-Labeling for Privacy-Preserving Gestational Diabetes Mellitus Prediction

/ May 5, 2026

arXiv:2605.01810v1 Announce Type: cross
Abstract: Gestational Diabetes Mellitus (GDM) is a high-prevalence pregnancy complication that requires accurate early risk stratification to reduce maternal and fetal morbidity. However, real-world clinical dep…

cs.CV

SpecEdit: Training-Free Acceleration for Diffusion based Image Editing via Semantic Locking

/ May 5, 2026

arXiv:2605.02152v1 Announce Type: new
Abstract: Diffusion-based image editing offers strong semantic controllability, but remains computationally expensive due to iterative high-resolution denoising over all spatial tokens. Dynamic-resolution sampling…

cs.CV, cs.HC, cs.LG

Multimodal Ambivalence/Hesitancy Recognition in Videos for Personalized Digital Health Interventions

/ May 5, 2026

arXiv:2604.11730v3 Announce Type: replace-cross
Abstract: Using behavioural science, health interventions focus on behaviour change by providing a framework to help patients acquire and maintain healthy habits that improve medical outcomes. In-person …

cs.AI, cs.CV, cs.LG

AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation Models

/ May 5, 2026

arXiv:2506.09082v5 Announce Type: replace-cross
Abstract: The rise of vision foundation models (VFMs) calls for systematic evaluation. A common approach pairs VFMs with large language models (LLMs) as general-purpose heads, followed by evaluation on b…

cs.CV

Adapting Vision-Language Foundation Model for Next Generation Medical Ultrasound Image Analysis

/ May 5, 2026

arXiv:2506.08849v4 Announce Type: replace
Abstract: Vision-Language Foundation Models (VLFMs) exhibit remarkable generalization, yet their direct application to medical ultrasound is severely hindered by a profound modality gap. The unique acoustic ph…

cs.LG

Skipping the Zeros in Diffusion Models for Sparse Data Generation

/ May 5, 2026

arXiv:2605.01817v1 Announce Type: new
Abstract: Diffusion models (DMs) excel on dense continuous data, but are not designed for sparse continuous data. They do not model exact zeros that represent the deliberate absence of a signal. As a result, they …

cs.AI, cs.CV

Decision Boundary-aware Generation for Long-tailed Learning

/ May 5, 2026

arXiv:2605.01468v1 Announce Type: cross
Abstract: Long-tailed data bias decision boundaries toward head classes and degrade tail class accuracy. Diffusion-based generative augmentation address this problem by generating additional data, while head-to-…

Artificial Intelligence, Generative AI

Relying on LLMs is nearly impossible when AI vendors keep changing things

/ May 4, 2026

Over the years, enterprise IT execs have gotten frighteningly comfortable having little control or visibility over mission-critical apps, from SaaS to cloud and even cybersecurity. But generative AI (genAI) and agentic systems ar…