- Provide.ai - Page 68

Rethinking Network Topologies for Cost-Effective Mixture-of-Experts LLM Serving

/ May 5, 2026

arXiv:2605.00254v1 Announce Type: cross
Abstract: Mixture-of-experts (MoE) architectures have turned LLM serving into a cluster-scale workload in which communication consumes a considerable portion of LLM serving runtime. This has prompted industry to…

cs.CV

From Where Things Are to What They Are For: Benchmarking Spatial-Functional Intelligence in Multimodal LLMs

/ May 5, 2026

arXiv:2605.02130v1 Announce Type: new
Abstract: Human-level agentic intelligence extends beyond low-level geometric perception, evolving from recognizing where things are to understanding what they are for. While existing benchmarks effectively evalua…

cs.DL, cs.LG

ARA: Agentic Reproducibility Assessment For Scalable Support Of Scientific Peer-Review

/ May 5, 2026

arXiv:2605.02651v1 Announce Type: cross
Abstract: Scientific peer review increasingly struggles to assess reproducibility at the scale and complexity of modern research output. Evaluating reproducibility requires reconstructing experimental dependenci…

cs.AI, cs.RO

VILAS: A VLA-Integrated Low-cost Architecture with Soft Grasping for Robotic Manipulation

/ May 5, 2026

arXiv:2605.02037v1 Announce Type: cross
Abstract: We present VILAS, a fully low-cost, modular robotic manipulation platform designed to support end-to-end vision-language-action (VLA) policy learning and deployment on accessible hardware. The system i…

cs.CV, cs.RO

VoxAfford: Multi-Scale Voxel-Token Fusion for Open-Vocabulary 3D Affordance Detection

/ May 5, 2026

arXiv:2605.01365v1 Announce Type: cross
Abstract: Open-vocabulary 3D affordance detection requires localizing interaction regions on point clouds given novel affordance descriptions. Recent methods extend multimodal large language models (MLLMs) with …

cs.AI, cs.LG

Recurrent Deep Reinforcement Learning for Chemotherapy Control under Partial Observability

/ May 5, 2026

arXiv:2605.02552v1 Announce Type: cross
Abstract: Chemotherapy dose optimization can be formulated as a dynamic treatment regime, requiring sequential decisions under uncertainty that must balance tumor suppression against toxicity. However, most rein…

cs.AI, cs.LG

Federated Semi-Supervised Graph Neural Networks with Prototype-Guided Pseudo-Labeling for Privacy-Preserving Gestational Diabetes Mellitus Prediction

/ May 5, 2026

arXiv:2605.01810v1 Announce Type: cross
Abstract: Gestational Diabetes Mellitus (GDM) is a high-prevalence pregnancy complication that requires accurate early risk stratification to reduce maternal and fetal morbidity. However, real-world clinical dep…

cs.CV

SpecEdit: Training-Free Acceleration for Diffusion based Image Editing via Semantic Locking

/ May 5, 2026

arXiv:2605.02152v1 Announce Type: new
Abstract: Diffusion-based image editing offers strong semantic controllability, but remains computationally expensive due to iterative high-resolution denoising over all spatial tokens. Dynamic-resolution sampling…

cs.CV, cs.HC, cs.LG

Multimodal Ambivalence/Hesitancy Recognition in Videos for Personalized Digital Health Interventions

/ May 5, 2026

arXiv:2604.11730v3 Announce Type: replace-cross
Abstract: Using behavioural science, health interventions focus on behaviour change by providing a framework to help patients acquire and maintain healthy habits that improve medical outcomes. In-person …

cs.CV

Adapting Vision-Language Foundation Model for Next Generation Medical Ultrasound Image Analysis

/ May 5, 2026

arXiv:2506.08849v4 Announce Type: replace
Abstract: Vision-Language Foundation Models (VLFMs) exhibit remarkable generalization, yet their direct application to medical ultrasound is severely hindered by a profound modality gap. The unique acoustic ph…