- Provide.ai - Page 113

ELMoE-3D: Leveraging Intrinsic Elasticity of MoE for Hybrid-Bonding-Enabled Self-Speculative Decoding in On-Premises Serving

/ April 24, 2026

arXiv:2604.14626v2 Announce Type: replace-cross
Abstract: Mixture-of-Experts (MoE) models have become the dominant architecture for large-scale language models, yet on-premises serving remains fundamentally memory-bound as batching turns sparse per-to…

cs.RO

Hi-WM: Human-in-the-World-Model for Scalable Robot Post-Training

/ April 24, 2026

arXiv:2604.21741v1 Announce Type: new
Abstract: Post-training is essential for turning pretrained generalist robot policies into reliable task-specific controllers, but existing human-in-the-loop pipelines remain tied to physical execution: each corre…

cs.AI, cs.LG

Deep FinResearch Bench: Evaluating AI’s Ability to Conduct Professional Financial Investment Research

/ April 24, 2026

arXiv:2604.21006v1 Announce Type: new
Abstract: We introduce Deep FinResearch Bench, a practical and comprehensive evaluation framework for deep research (DR) agents in financial investment research. The benchmark assesses three dimensions of report q…

cs.AI

KompeteAI: Accelerated Autonomous Multi-Agent System for End-to-End Pipeline Generation for Machine Learning Problems

/ April 24, 2026

arXiv:2508.10177v3 Announce Type: replace
Abstract: Recent Large Language Model (LLM)-based AutoML systems demonstrate impressive capabilities but face significant limitations such as constrained exploration strategies and a severe execution bottlenec…

cs.AI, math.OC

Integrated packing, placement, scheduling, and routing of personalized production: a pharmaceutical Industry 4.0 use-case with a planar transport system

/ April 24, 2026

arXiv:2604.21029v1 Announce Type: cross
Abstract: The recent emergence of planar transport systems necessitates re-evaluation of Flexible Manufacturing Systems (FMS) to address the simultaneous scheduling of internal logistics and production operation…

cs.AI

AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts

/ April 24, 2026

arXiv:2601.11044v4 Announce Type: replace
Abstract: Large Language Models (LLMs) based autonomous agents demonstrate multifaceted capabilities to contribute substantially to economic production. However, existing benchmarks remain focused on single ag…

cs.AI, cs.LG

Wiring the ‘Why’: A Unified Taxonomy and Survey of Abductive Reasoning in LLMs

/ April 24, 2026

arXiv:2604.08016v2 Announce Type: replace
Abstract: Regardless of its foundational role in human discovery and sense-making, abductive reasoning–the inference of the most plausible explanation for an observation–has been relatively underexplored in …

cs.AI, cs.LG

mGRADE: Minimal Recurrent Gating Meets Delay Convolutions for Lightweight Sequence Modeling

/ April 24, 2026

arXiv:2507.01829v2 Announce Type: replace-cross
Abstract: Multi-timescale sequence modeling relies on capturing both local fast dynamics and global slow context; yet, maintaining these capabilities under the strict memory constraints common to edge de…

cs.AI, cs.LG

Retrofit: Continual Learning with Controlled Forgetting for Binary Security Detection and Analysis

/ April 24, 2026

arXiv:2511.11439v2 Announce Type: replace-cross
Abstract: Binary security has increasingly relied on deep learning to reason about malware behavior and program semantics. However, the performance often degrades as threat landscapes evolve and code rep…

cs.AI, cs.LG

CAP: Controllable Alignment Prompting for Unlearning in LLMs

/ April 24, 2026

arXiv:2604.21251v1 Announce Type: cross
Abstract: Large language models (LLMs) trained on unfiltered corpora inherently risk retaining sensitive information, necessitating selective knowledge unlearning for regulatory compliance and ethical safety. Ho…