- Provide.ai - Page 438

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

/ March 26, 2026

arXiv:2603.24472v1 Announce Type: cross
Abstract: Self-distillation has emerged as an effective post-training paradigm for LLMs, often improving performance while shortening reasoning traces. However, in mathematical reasoning, we find that it can red…

cs.CV, cs.LG

GRMLR: Knowledge-Enhanced Small-Data Learning for Deep-Sea Cold Seep Stage Inference

/ March 26, 2026

arXiv:2603.23961v1 Announce Type: new
Abstract: Deep-sea cold seep stage assessment has traditionally relied on costly, high-risk manned submersible operations and visual surveys of macrofauna. Although microbial communities provide a promising and mo…

cs.LG, q-bio.BM

ZeroFold: Protein-RNA Binding Affinity Predictions from Pre-Structural Embeddings

/ March 26, 2026

arXiv:2603.23583v1 Announce Type: cross
Abstract: The accurate prediction of protein-RNA binding affinity remains an unsolved problem in structural biology, limiting opportunities in understanding gene regulation and designing RNA-targeting therapeuti…

cs.AI, cs.CL, cs.CY, cs.IR, cs.LG

Retrieval Improvements Do Not Guarantee Better Answers: A Study of RAG for AI Policy QA

/ March 26, 2026

arXiv:2603.24580v1 Announce Type: cross
Abstract: Retrieval-augmented generation (RAG) systems are increasingly used to analyze complex policy documents, but achieving sufficient reliability for expert usage remains challenging in domains characterize…

cs.AI, cs.CR

OSS-CRS: Liberating AIxCC Cyber Reasoning Systems for Real-World Open-Source Security

/ March 26, 2026

arXiv:2603.08566v2 Announce Type: replace-cross
Abstract: DARPA’s AI Cyber Challenge (AIxCC) showed that cyber reasoning systems (CRSs) can go beyond vulnerability discovery to autonomously confirm and patch bugs: seven teams built such systems and op…

cs.AI

SAG-Agent: Enabling Long-Horizon Reasoning in Strategy Games via Dynamic Knowledge Graphs

/ March 26, 2026

arXiv:2510.15259v3 Announce Type: replace
Abstract: Most commodity software lacks accessible Application Programming Interfaces (APIs), requiring autonomous agents to interact solely through pixel-based Graphical User Interfaces (GUIs). In this API-fr…

cs.AI, cs.SE

CIRCLE: A Framework for Evaluating AI from a Real-World Lens

/ March 26, 2026

arXiv:2602.24055v4 Announce Type: replace
Abstract: This paper proposes CIRCLE, a six-stage, lifecycle-based framework to bridge the reality gap between model-centric performance metrics and AI’s materialized outcomes in deployment. Current approaches…

cs.CV

CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare

/ March 26, 2026

arXiv:2603.24157v1 Announce Type: new
Abstract: Multimodal agentic pipelines are transforming human-computer interaction by enabling efficient and accessible automation of complex, real-world tasks. However, recent efforts have focused on short-horizo…

cs.AI

Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering

/ March 26, 2026

arXiv:2601.10402v5 Announce Type: replace
Abstract: The advancement of artificial intelligence toward agentic science is currently bottlenecked by the challenge of ultra-long-horizon autonomy, the ability to sustain strategic coherence and iterative c…

cs.CV

CA-LoRA: Concept-Aware LoRA for Domain-Aligned Segmentation Dataset Generation

/ March 26, 2026

arXiv:2503.22172v2 Announce Type: replace
Abstract: This paper addresses the challenge of data scarcity in semantic segmentation by generating datasets through text-to-image (T2I) generation models, reducing image acquisition and labeling costs. Segme…