- Provide.ai - Page 298

Lightweight LLM Agent Memory with Small Language Models

/ April 23, 2026

arXiv:2604.07798v3 Announce Type: replace
Abstract: Although LLM agents can leverage tools for complex tasks, they still need memory to maintain cross-turn consistency and accumulate reusable information in long-horizon interactions. However, retrieva…

cs.LG, cs.NE, nlin.AO

An explicit operator explains end-to-end computation in the modern neural networks used for sequence and language modeling

/ April 23, 2026

arXiv:2604.20595v1 Announce Type: cross
Abstract: We establish a mathematical correspondence between state space models, a state-of-the-art architecture for capturing long-range dependencies in data, and an exactly solvable nonlinear oscillator networ…

cs.AI, cs.CL, cs.IR, cs.LG

DR-Venus: Towards Frontier Edge-Scale Deep Research Agents with Only 10K Open Data

/ April 23, 2026

arXiv:2604.19859v1 Announce Type: new
Abstract: Edge-scale deep research agents based on small language models are attractive for real-world deployment due to their advantages in cost, latency, and privacy. In this work, we study how to train a strong…

cs.AI, cs.LG

GRPO-VPS: Enhancing Group Relative Policy Optimization with Verifiable Process Supervision for Effective Reasoning

/ April 23, 2026

arXiv:2604.20659v1 Announce Type: new
Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has advanced the reasoning capabilities of Large Language Models (LLMs) by leveraging direct outcome verification instead of learned reward models. B…

cs.LG

Super Apriel: One Checkpoint, Many Speeds

/ April 23, 2026

arXiv:2604.19877v1 Announce Type: new
Abstract: We release Super Apriel, a 15B-parameter supernet in which every decoder layer provides four trained mixer choices — Full Attention (FA), Sliding Window Attention (SWA), Kimi Delta Attention (KDA), and …

cs.AI, cs.CL, cs.DB

PersonalHomeBench: Evaluating Agents in Personalized Smart Homes

/ April 23, 2026

arXiv:2604.16813v2 Announce Type: replace
Abstract: Agentic AI systems are rapidly advancing toward real-world applications, yet their readiness in complex and personalized environments remains insufficiently characterized. To address this gap, we int…

cs.AI, cs.CY, cs.LG

A Multi-Plant Machine Learning Framework for Emission Prediction, Forecasting, and Control in Cement Manufacturing

/ April 23, 2026

arXiv:2604.19903v1 Announce Type: new
Abstract: Cement production is among the largest contributors to industrial air pollution, emitting ~3 Mt NOx/year. The industry-standard mitigation approach, selective non-catalytic reduction (SNCR), exhibits low…

cs.AI, cs.LG

Storm Surge Modeling, Bias Correction, Graph Neural Networks, Graph Convolution Networks

/ April 23, 2026

arXiv:2604.20688v1 Announce Type: new
Abstract: Storm surge forecasting remains a critical challenge in mitigating the impacts of tropical cyclones on coastal regions, particularly given recent trends of rapid intensification and increasing nearshore …

cs.AI, cs.CL, cs.HC, cs.LG, cs.RO

MOMO: A framework for seamless physical, verbal, and graphical robot skill learning and adaptation

/ April 23, 2026

arXiv:2604.20468v1 Announce Type: cross
Abstract: Industrial robot applications require increasingly flexible systems that non-expert users can easily adapt for varying tasks and environments. However, different adaptations benefit from different inte…

cs.AI, cs.CY

Fairness Testing of Large Language Models in Role-Playing

/ April 23, 2026

arXiv:2411.00585v2 Announce Type: replace-cross
Abstract: Large Language Models (LLMs) have become foundational in modern language-driven software applications, profoundly influencing daily life. A critical technique in leveraging their potential is r…