- Provide.ai - Page 20

STABLEVAL: Disagreement-Aware and Stable Evaluation of AI Systems

/ May 5, 2026

arXiv:2605.02122v1 Announce Type: cross
Abstract: Human evaluation remains the primary standard for assessing modern AI systems, yet annotator disagreement, bias, and variability make system rankings fragile under standard majority vote aggregation. M…

cs.LG

LUMINA: A Grid Foundation Model for Benchmarking AC Optimal Power Flow Surrogate Learning

/ May 5, 2026

arXiv:2605.02133v1 Announce Type: new
Abstract: AC optimal power flow (ACOPF) is foundational yet computationally expensive in power grid operations, driving learning-based surrogates for large-scale grid analysis. These surrogates, however, often fai…

cs.AI, cs.CV

TOC-SR: Task-Optimal Compact diffusion for Image Super Resolution

/ May 5, 2026

arXiv:2605.02767v1 Announce Type: cross
Abstract: Diffusion models have recently demonstrated strong performance for image restoration tasks, including super-resolution. However, their large model size and iterative sampling procedures make them compu…

cs.AI, cs.CV, cs.LG

Active Reasoning Vision-Language Models via Sequential Experimental Design

/ May 5, 2026

arXiv:2605.01345v1 Announce Type: cross
Abstract: Visual perception in modern Vision-Language Models (VLMs) is constrained by a fundamental perceptual bandwidth bottleneck: a broad field of view inevitably sacrifices the fine-grained details necessary…

cs.AI, cs.CV

Sparse Representation Learning for Vessels

/ May 5, 2026

arXiv:2605.01382v1 Announce Type: cross
Abstract: Analyzing human vasculature and vessel-like, tubular structures, such as airways, is crucial for disease diagnosis and treatment. Current methods often rely on small sub-regions or simplified tree-like…

cs.CV

VISTA: Video Interaction Spatio-Temporal Analysis Benchmark

/ May 5, 2026

arXiv:2605.01391v1 Announce Type: new
Abstract: Existing benchmarks for Vision-Language Models (VLMs) primarily evaluate spatio-temporal understanding on simple single-action videos, closed attribute sets and restricted entity types, failing to captur…

cs.CV

Act in Collusion: Distributed Multi-Target Backdoor Attacks in Federated Learning

/ May 5, 2026

arXiv:2411.03926v3 Announce Type: replace
Abstract: Federated learning (FL) is widely used in Internet-of-Things (IoT) systems, but its distributed training process also exposes it to backdoor attacks. Existing studies mainly consider single-target or…

cs.CV

Registration-Free Learnable Multi-View Capture of Faces in Dense Semantic Correspondence

/ May 5, 2026

arXiv:2605.01450v1 Announce Type: new
Abstract: Recent frameworks like ToFu and TEMPEH provide an automated alternative to classical registration pipelines by predicting 3D meshes in dense semantic correspondence directly from calibrated multi-view im…

cs.AI, cs.CV

SRGAN-CKAN: Expressive Super-Resolution with Nonlinear Functional Operators under Minimal Resources

/ May 5, 2026

arXiv:2605.01459v1 Announce Type: cross
Abstract: Single-Image Super-Resolution (SISR) aims to reconstruct a High-Resolution (HR) image from a Low-Resolution (LR) observation, a fundamentally ill-posed problem where high-frequency details are severely…

cs.AI, cs.CV, cs.LG

AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation Models

/ May 5, 2026

arXiv:2506.09082v5 Announce Type: replace-cross
Abstract: The rise of vision foundation models (VFMs) calls for systematic evaluation. A common approach pairs VFMs with large language models (LLMs) as general-purpose heads, followed by evaluation on b…