- Provide.ai - Page 294

DASH-KV: Accelerating Long-Context LLM Inference via Asymmetric KV Cache Hashing

/ April 23, 2026

arXiv:2604.19351v2 Announce Type: replace
Abstract: The quadratic computational complexity of the standard attention mechanism constitutes a fundamental bottleneck for large language models in long-context inference. While existing KV cache compressio…

cs.CL, cs.HC

LLAMADRS: Evaluating Open-Source LLMs on Real Clinical Interviews–To Reason or Not to Reason?

/ April 23, 2026

arXiv:2501.03624v2 Announce Type: replace-cross
Abstract: Large language models (LLMs) excel on many NLP benchmarks, but their behavior on real-world, semi-structured prediction remains underexplored. We present LlaMADRS, a benchmark for structured cl…

cs.AI, cs.CR

DAIRE: A lightweight AI model for real-time detection of Controller Area Network attacks in the Internet of Vehicles

/ April 23, 2026

arXiv:2604.20771v1 Announce Type: cross
Abstract: The Internet of Vehicles (IoV) is advancing modern transportation by improving safety, efficiency, and intelligence. However, the reliance on the Controller Area Network (CAN) introduces critical secur…

cs.CL, cs.IR

SAKE: Self-aware Knowledge Exploitation-Exploration for Grounded Multimodal Named Entity Recognition

/ April 23, 2026

arXiv:2604.20146v1 Announce Type: cross
Abstract: Grounded Multimodal Named Entity Recognition (GMNER) aims to extract named entities and localize their visual regions within image-text pairs, serving as a pivotal capability for various downstream app…

cs.AI

A Survey of Scaling in Large Language Model Reasoning

/ April 23, 2026

arXiv:2504.02181v2 Announce Type: replace
Abstract: The rapid advancements in large Language models (LLMs) have significantly enhanced their reasoning capabilities, driven by various strategies such as multi-agent collaboration. However, unlike the we…

cs.AI, cs.CL

Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training

/ April 23, 2026

arXiv:2508.00414v3 Announce Type: replace
Abstract: General AI Agents are increasingly recognized as foundational frameworks for the next generation of artificial intelligence, enabling complex reasoning, web interaction, coding, and autonomous resear…

cs.AI, cs.CL

AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite

/ April 23, 2026

arXiv:2510.21652v2 Announce Type: replace
Abstract: AI agents hold the potential to revolutionize scientific productivity by automating literature reviews, replicating experiments, analyzing data, and even proposing new directions of inquiry; indeed, …

cs.AI

Lightweight LLM Agent Memory with Small Language Models

/ April 23, 2026

arXiv:2604.07798v3 Announce Type: replace
Abstract: Although LLM agents can leverage tools for complex tasks, they still need memory to maintain cross-turn consistency and accumulate reusable information in long-horizon interactions. However, retrieva…

cs.AI, cs.CL, cs.DB

PersonalHomeBench: Evaluating Agents in Personalized Smart Homes

/ April 23, 2026

arXiv:2604.16813v2 Announce Type: replace
Abstract: Agentic AI systems are rapidly advancing toward real-world applications, yet their readiness in complex and personalized environments remains insufficiently characterized. To address this gap, we int…

cs.AI, cs.CY

Fairness Testing of Large Language Models in Role-Playing

/ April 23, 2026

arXiv:2411.00585v2 Announce Type: replace-cross
Abstract: Large Language Models (LLMs) have become foundational in modern language-driven software applications, profoundly influencing daily life. A critical technique in leveraging their potential is r…