- Provide.ai - Page 67

SAGE: Scalable Automated Robustness Augmentation for LLM Knowledge Evaluation

/ May 13, 2026

arXiv:2605.12022v1 Announce Type: new
Abstract: Large Language Models (LLMs) achieve strong performance on standard knowledge evaluation benchmarks, yet recent work shows that their knowledge capabilities remain brittle under question variants that te…

cs.LG

A Comparative Study of Federated Learning Aggregation Strategies under Homogeneous and Heterogeneous Data Distributions

/ May 13, 2026

arXiv:2605.11010v1 Announce Type: new
Abstract: Federated Learning has emerged as a transformative paradigm for collaborative machine learning across distributed environments. However, its performance is strongly influenced by the aggregation strategy…

cs.AI, cs.LG

Backbone-Equated Diffusion OOD via Sparse Internal Snapshots

/ May 13, 2026

arXiv:2605.11014v1 Announce Type: new
Abstract: Fair comparison between diffusion-based OOD detectors is challenging, as conclusions can vary with backbone choice, corruption parameterization, and test-time budget. We address this issue through a Mutu…

cs.AI, cs.CL

Coevolutionary Continuous Discrete Diffusion: Make Your Diffusion Language Model a Latent Reasoner

/ May 13, 2026

arXiv:2510.03206v2 Announce Type: replace-cross
Abstract: Diffusion language models, especially masked discrete diffusion models, have achieved great success recently. While there are some theoretical and primary empirical results showing the advantag…

cs.AI, cs.LG

Learning, Fast and Slow: Towards LLMs That Adapt Continually

/ May 13, 2026

arXiv:2605.12484v1 Announce Type: new
Abstract: Large language models (LLMs) are trained for downstream tasks by updating their parameters (e.g., via RL). However, updating parameters forces them to absorb task-specific information, which can result i…

cs.CV

Principled Design of Diffusion-based Optimizers for Inverse Problems

/ May 13, 2026

arXiv:2605.11506v1 Announce Type: new
Abstract: Score-based diffusion models achieve state-of-the-art performance for inverse problems, but their practical deployment is hindered by long inference times and cumbersome hyperparameter tuning. While pret…

cs.CV

LiBrA-Net: Lie-Algebraic Bilateral Affine Fields for Real-Time 4K Video Dehazing

/ May 13, 2026

arXiv:2605.11508v1 Announce Type: new
Abstract: Currently, there is a gap in the field of ultra-high-definition (UHD) video dehazing due to the lack of a benchmark for evaluation. Furthermore, existing video dehazing methods cannot run on consumer-gra…

cs.AI, cs.ET, cs.LG, quant-ph

Measuring Accuracy and Energy-to-Solution of Quantum Fine-Tuning of Foundational AI Models

/ May 13, 2026

arXiv:2605.02798v1 Announce Type: cross
Abstract: We present an experimental study of energy-to-solution (ETS) of hybrid quantum-classical applications, enabled by direct instrumentation of power consumption of a Forte Enterprise trapped-ion quantum p…

cs.CL, cs.CV

MobileEgo Anywhere: Open Infrastructure for long horizon egocentric data on commodity hardware

/ May 13, 2026

arXiv:2605.05945v2 Announce Type: replace-cross
Abstract: The recent advancement of Vision Language Action (VLA) models has driven a critical demand for large scale egocentric datasets. However, existing datasets are often limited by short episode dur…

cs.CL, cs.CY

Metaphor Is Not All Attention Needs

/ May 13, 2026

arXiv:2605.12128v1 Announce Type: new
Abstract: Large language models are increasingly deployed in safety-critical applications, where their ability to resist harmful instructions is essential. Although post-training aims to make models robust against…