- Provide.ai - Page 425

GISTBench: Evaluating LLM User Understanding via Evidence-Based Interest Verification

/ April 1, 2026

arXiv:2603.29112v1 Announce Type: cross
Abstract: We introduce GISTBench, a benchmark for evaluating Large Language Models’ (LLMs) ability to understand users from their interaction histories in recommendation systems. Unlike traditional RecSys benchm…

cs.CV, cs.LG

Interpretable and Steerable Concept Bottleneck Sparse Autoencoders

/ April 1, 2026

arXiv:2512.10805v2 Announce Type: replace-cross
Abstract: Sparse autoencoders (SAEs) promise a unified approach for mechanistic interpretability, concept discovery, and model steering in LLMs and LVLMs. However, realizing this potential requires learn…

cs.CV

EarthEmbeddingExplorer: A Web Application for Cross-Modal Retrieval of Global Satellite Images

/ April 1, 2026

arXiv:2603.29441v1 Announce Type: new
Abstract: While the Earth observation community has witnessed a surge in high-impact foundation models and global Earth embedding datasets, a significant barrier remains in translating these academic assets into f…

cs.AI, cs.CV, cs.LG

GenOL: Generating Diverse Examples for Name-only Online Learning

/ April 1, 2026

arXiv:2403.10853v4 Announce Type: replace-cross
Abstract: Online learning methods often rely on supervised data. However, under data distribution shifts, such as in continual learning (CL), where continuously arriving online data streams incorporate n…

cs.AI, cs.CL, cs.HC, cs.IR, cs.MA

MA-SAPO: Multi-Agent Reasoning for Score-Aware Prompt Optimization

/ April 1, 2026

arXiv:2510.16635v2 Announce Type: replace-cross
Abstract: Prompt optimization has become a practical way to improve the performance of Large Language Models (LLMs) without retraining. However, most existing frameworks treat evaluation as a black box, …

cs.AI, cs.CV

NeoNet: An End-to-End 3D MRI-Based Deep Learning Framework for Non-Invasive Prediction of Perineural Invasion via Generation-Driven Classification

/ April 1, 2026

arXiv:2603.29449v1 Announce Type: new
Abstract: Minimizing invasive diagnostic procedures to reduce the risk of patient injury and infection is a central goal in medical imaging. And yet, noninvasive diagnosis of perineural invasion (PNI), a critical …

cs.DC, cs.LG

CRAFT: Cost-aware Expert Replica Allocation with Fine-Grained Layerwise Estimations

/ April 1, 2026

arXiv:2603.28768v1 Announce Type: cross
Abstract: Mixture-of-Experts (MoE) has recently emerged as the mainstream architecture for efficiently scaling large language models while maintaining near-constant computational cost. Expert parallelism distrib…

cs.AI, cs.CY, econ.GN, q-fin.EC

Economics of Human and AI Collaboration: When is Partial Automation More Attractive than Full Automation?

/ April 1, 2026

arXiv:2603.29121v1 Announce Type: cross
Abstract: This paper develops a unified framework for evaluating the optimal degree of task automation. Moving beyond binary automate-or-not assessments, we model automation intensity as a continuous choice in w…

cs.CV

Square Superpixel Generation and Representation Learning via Granular Ball Computing

/ April 1, 2026

arXiv:2603.29460v1 Announce Type: new
Abstract: Superpixels provide a compact region-based representation that preserves object boundaries and local structures, and have therefore been widely used in a variety of vision tasks to reduce computational c…

cs.AI

TeamMedAgents: Pareto-Efficient Multi-Agent Medical Reasoning Through Teamwork Theory

/ April 1, 2026

arXiv:2508.08115v3 Announce Type: replace
Abstract: Complex medical reasoning has historically required frontier language models to achieve clinically-acceptable accuracy, creating computational barriers that limit deployment in resource-constrained c…