- Provide.ai - Page 9

LinguDistill: Recovering Linguistic Ability in Vision-Language Models via Selective Cross-Modal Distillation

/ April 28, 2026

arXiv:2604.00829v3 Announce Type: replace-cross
Abstract: Adapting pretrained language models (LMs) into vision-language models (VLMs) can degrade their native linguistic capability due to representation shift and cross-modal interference introduced d…

cs.AI, cs.CL

Evaluating Language Models’ Evaluations of Games

/ April 28, 2026

arXiv:2510.10930v2 Announce Type: replace
Abstract: Reasoning is not just about solving problems — it is also about evaluating which problems are worth solving at all. Evaluations of artificial intelligence (AI) systems primarily focused on problem s…

cs.CL

Can LLMs Act as Historians? Evaluating Historical Research Capabilities of LLMs via the Chinese Imperial Examination

/ April 28, 2026

arXiv:2604.24690v1 Announce Type: new
Abstract: While Large Language Models (LLMs) have increasingly assisted in historical tasks such as text processing, their capacity for professional-level historical reasoning remains underexplored. Existing bench…

cs.CL

AI use in American newspapers is widespread, uneven, and rarely disclosed

/ April 28, 2026

arXiv:2510.18774v4 Announce Type: replace
Abstract: AI is rapidly transforming journalism, but the extent of its use in published newspaper articles remains unclear. We address this gap by auditing a large-scale dataset of 186K articles from online ed…

cs.AI, cs.CL

Green Shielding: A User-Centric Approach Towards Trustworthy AI

/ April 28, 2026

arXiv:2604.24700v1 Announce Type: new
Abstract: Large language models (LLMs) are increasingly deployed, yet their outputs can be highly sensitive to routine, non-adversarial variation in how users phrase queries, a gap not well addressed by existing r…

cs.AI, cs.CL

Agentic clinical reasoning over longitudinal myeloma records: a retrospective evaluation against expert consensus

/ April 28, 2026

arXiv:2604.24473v1 Announce Type: cross
Abstract: Multiple myeloma is managed through sequential lines of therapy over years to decades, with each decision depending on cumulative disease history distributed across dozens to hundreds of heterogeneous …

cs.AI, cs.CL, cs.SD

Hearing to Translate: The Effectiveness of Speech Modality Integration into LLMs

/ April 28, 2026

arXiv:2512.16378v4 Announce Type: replace
Abstract: As Large Language Models (LLMs) expand beyond text, integrating speech as a native modality has given rise to SpeechLLMs, which directly process spoken language and enable speech-to-text translation …

cs.CL, cs.LG

Long-Context Aware Upcycling: A New Frontier for Hybrid LLM Scaling

/ April 28, 2026

arXiv:2604.24715v1 Announce Type: new
Abstract: Hybrid sequence models that combine efficient Transformer components with linear sequence modeling blocks are a promising alternative to pure Transformers, but most are still pretrained from scratch and …

cs.AI, cs.CL, cs.LG

Quantifying and Improving the Robustness of Retrieval-Augmented Language Models Against Spurious Features in Grounding Data

/ April 28, 2026

arXiv:2503.05587v3 Announce Type: replace
Abstract: Robustness has become a critical attribute for the deployment of RAG systems in real-world applications. Existing research focuses on robustness to explicit noise (e.g., document semantics) but overl…

cs.AI, cs.CL, cs.CV

Agri-CPJ: A Training-Free Explainable Framework for Agricultural Pest Diagnosis Using Caption-Prompt-Judge and LLM-as-a-Judge

/ April 28, 2026

arXiv:2604.23701v1 Announce Type: new
Abstract: Crop disease diagnosis from field photographs faces two recurring problems: models that score well on benchmarks frequently hallucinate species names, and when predictions are correct, the reasoning behi…