- Provide.ai - Page 423

LVLMs and Humans Ground Differently in Referential Communication

/ April 21, 2026

arXiv:2601.19792v3 Announce Type: replace
Abstract: For generative AI agents to partner effectively with human users, the ability to accurately predict human intent is critical. But this ability to collaborate remains limited by a critical deficit: an…

cs.CL

MoCo: A One-Stop Shop for Model Collaboration Research

/ April 21, 2026

arXiv:2601.21257v2 Announce Type: replace
Abstract: Advancing beyond single monolithic language models (LMs), recent research increasingly recognizes the importance of model collaboration, where multiple LMs collaborate, compose, and complement each o…

cs.LG, cs.PL, quant-ph

MerLin: A Discovery Engine for Photonic and Hybrid Quantum Machine Learning

/ April 21, 2026

arXiv:2602.11092v2 Announce Type: replace
Abstract: Identifying where quantum models may offer practical benefits in near term quantum machine learning (QML) requires moving beyond isolated algorithmic proposals toward systematic and empirical explora…

cs.AI, cs.LG

Bounded Ratio Reinforcement Learning

/ April 21, 2026

arXiv:2604.18578v1 Announce Type: new
Abstract: Proximal Policy Optimization (PPO) has become the predominant algorithm for on-policy reinforcement learning due to its scalability and empirical robustness across domains. However, there is a significan…

cs.CV

WildDet3D: Scaling Promptable 3D Detection in the Wild

/ April 21, 2026

arXiv:2604.08626v2 Announce Type: replace
Abstract: Understanding objects in 3D from a single image is a cornerstone of spatial intelligence. A key step toward this goal is monocular 3D object detection–recovering the extent, location, and orientatio…

cs.CV

The First Challenge on Mobile Real-World Image Super-Resolution at NTIRE 2026: Benchmark Results and Method Overview

/ April 21, 2026

arXiv:2604.17306v1 Announce Type: new
Abstract: This paper provides a review of the NTIRE 2026 challenge on mobile real-world image super-resolution, highlighting the proposed solutions and the resulting outcomes. The challenge aims to recover high-re…

cs.CL

BenchMarker: An Education-Inspired Toolkit for Highlighting Flaws in Multiple-Choice Benchmarks

/ April 21, 2026

arXiv:2602.06221v2 Announce Type: replace
Abstract: Multiple-choice question answering (MCQA) is standard in NLP, but benchmarks lack rigorous quality control. We present BenchMarker, an education-inspired toolkit using LLM judges to flag three common…

cs.CV, cs.LG

Vision Language Models are Biased

/ April 21, 2026

arXiv:2505.23941v4 Announce Type: replace
Abstract: Large language models (LLMs) memorize a vast amount of prior knowledge from the Internet that helps them on downstream tasks but also may notoriously sway their outputs towards wrong or biased answer…

cs.AI, cs.CL, cs.DB

PersonalHomeBench: Evaluating Agents in Personalized Smart Homes

/ April 21, 2026

arXiv:2604.16813v1 Announce Type: cross
Abstract: Agentic AI systems are rapidly advancing toward real-world applications, yet their readiness in complex and personalized environments remains insufficiently characterized. To address this gap, we intro…

cs.CV

NTIRE 2026 Challenge on Single Image Reflection Removal in the Wild: Datasets, Results, and Methods

/ April 21, 2026

arXiv:2604.10321v2 Announce Type: replace
Abstract: In this paper, we review the NTIRE 2026 challenge on single-image reflection removal (SIRR) in the wild. SIRR is a fundamental task in image restoration. Despite progress in academic research, most m…