- Provide.ai - Page 296

A Survey of Scaling in Large Language Model Reasoning

/ April 23, 2026

arXiv:2504.02181v2 Announce Type: replace
Abstract: The rapid advancements in large Language models (LLMs) have significantly enhanced their reasoning capabilities, driven by various strategies such as multi-agent collaboration. However, unlike the we…

cs.AI, cs.CL

Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training

/ April 23, 2026

arXiv:2508.00414v3 Announce Type: replace
Abstract: General AI Agents are increasingly recognized as foundational frameworks for the next generation of artificial intelligence, enabling complex reasoning, web interaction, coding, and autonomous resear…

cs.AI, cs.CL

AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite

/ April 23, 2026

arXiv:2510.21652v2 Announce Type: replace
Abstract: AI agents hold the potential to revolutionize scientific productivity by automating literature reviews, replicating experiments, analyzing data, and even proposing new directions of inquiry; indeed, …

cs.CV, cs.LG

Learn2Synth: Learning Optimal Data Synthesis Using Hypergradients for Brain Image Segmentation

/ April 23, 2026

arXiv:2411.16719v4 Announce Type: replace-cross
Abstract: Domain randomization through synthesis is a powerful strategy to train networks that are unbiased with respect to the domain of the input images. Randomization allows networks to see a virtuall…

cs.LG

KANMixer: a minimal KAN-centered mixer for long-term time series forecasting

/ April 23, 2026

arXiv:2508.01575v2 Announce Type: replace
Abstract: Long-term time series forecasting (LTSF) underpins critical applications from energy management to weather prediction, yet achieving reliable multi-step-ahead accuracy remains challenging. Existing L…

cs.LG, cs.NE, nlin.AO

An explicit operator explains end-to-end computation in the modern neural networks used for sequence and language modeling

/ April 23, 2026

arXiv:2604.20595v1 Announce Type: cross
Abstract: We establish a mathematical correspondence between state space models, a state-of-the-art architecture for capturing long-range dependencies in data, and an exactly solvable nonlinear oscillator networ…

cs.AI, cs.CL, cs.IR, cs.LG

DR-Venus: Towards Frontier Edge-Scale Deep Research Agents with Only 10K Open Data

/ April 23, 2026

arXiv:2604.19859v1 Announce Type: new
Abstract: Edge-scale deep research agents based on small language models are attractive for real-world deployment due to their advantages in cost, latency, and privacy. In this work, we study how to train a strong…

cs.LG

Super Apriel: One Checkpoint, Many Speeds

/ April 23, 2026

arXiv:2604.19877v1 Announce Type: new
Abstract: We release Super Apriel, a 15B-parameter supernet in which every decoder layer provides four trained mixer choices — Full Attention (FA), Sliding Window Attention (SWA), Kimi Delta Attention (KDA), and …

cs.AI, cs.CY, cs.LG

A Multi-Plant Machine Learning Framework for Emission Prediction, Forecasting, and Control in Cement Manufacturing

/ April 23, 2026

arXiv:2604.19903v1 Announce Type: new
Abstract: Cement production is among the largest contributors to industrial air pollution, emitting ~3 Mt NOx/year. The industry-standard mitigation approach, selective non-catalytic reduction (SNCR), exhibits low…

cs.AI, cs.CL, cs.HC, cs.LG, cs.RO

MOMO: A framework for seamless physical, verbal, and graphical robot skill learning and adaptation

/ April 23, 2026

arXiv:2604.20468v1 Announce Type: cross
Abstract: Industrial robot applications require increasingly flexible systems that non-expert users can easily adapt for varying tasks and environments. However, different adaptations benefit from different inte…