- Provide.ai - Page 44

DORA: A Scalable Asynchronous Reinforcement Learning System for Language Model Training

/ April 30, 2026

arXiv:2604.26256v1 Announce Type: new
Abstract: Reinforcement learning (RL) has become a critical paradigm for LLM post-training, yet the rollout phase — accounting for 50–80% of total step time — is bottlenecked by skewed generation: long-tailed t…

cs.AI, cs.CL

Lunguage: A Benchmark for Structured and Sequential Chest X-ray Interpretation

/ April 30, 2026

arXiv:2505.21190v2 Announce Type: replace
Abstract: Radiology reports convey detailed clinical observations and capture diagnostic reasoning that evolves over time. However, existing evaluation methods are limited to single-report settings and rely on…

Artificial Intelligence, Government, Laws and Regulations

EU lawmakers fail to agree on watered-down AI Act, talks pushed to May

/ April 29, 2026

EU member states and the European Parliament failed to agree on changes that would have softened the bloc’s AI Act and pushed back its toughest enforcement deadlines.

The talks ran for about 12 hours on Tuesday and ended witho…

cs.AI, cs.CL, cs.RO

Limited Linguistic Diversity in Embodied AI Datasets

/ April 29, 2026

arXiv:2601.03136v2 Announce Type: replace
Abstract: Language plays a critical role in Vision-Language-Action (VLA) models, yet the linguistic characteristics of the datasets used to train and evaluate these systems remain poorly documented. In this wo…

cs.AI, cs.CL, cs.LG

Recursive Multi-Agent Systems

/ April 29, 2026

arXiv:2604.25917v1 Announce Type: cross
Abstract: Recursive or looped language models have recently emerged as a new scaling axis by iteratively refining the same model computation over latent states to deepen reasoning. We extend such scaling princip…

cs.CL, cs.HC

A Survey on LLM-based Conversational User Simulation

/ April 29, 2026

arXiv:2604.24977v1 Announce Type: new
Abstract: User simulation has long played a vital role in computer science due to its potential to support a wide range of applications. Language, as the primary medium of human communication, forms the foundation…

cs.AI, cs.CL, cs.SE

BenchGuard: Who Guards the Benchmarks? Automated Auditing of LLM Agent Benchmarks

/ April 29, 2026

arXiv:2604.24955v1 Announce Type: new
Abstract: As benchmarks grow in complexity, many apparent agent failures are not failures of the agent at all – they are failures of the benchmark itself: broken specifications, implicit assumptions, and rigid eva…

cs.CL, cs.SE

Don\’t Stop Early: Scalable Enterprise Deep Research with Controlled Information Flow and Evidence-Aware Termination

/ April 29, 2026

arXiv:2604.24978v1 Announce Type: new
Abstract: Enterprise deep research often fails to produce decision-ready reports due to uneven information coverage, context explosion, and premature stopping. We propose a scalable Enterprise Deep Research (EDR) …

cs.CL, cs.CV, cs.RO

MiMo-Embodied: X-Embodied Foundation Model Technical Report

/ April 29, 2026

arXiv:2511.16518v2 Announce Type: replace-cross
Abstract: We open-source MiMo-Embodied, the first cross-embodied foundation model to successfully integrate and achieve state-of-the-art performance in both Autonomous Driving and Embodied AI. MiMo-Embod…

cs.AI, cs.CL, cs.LG

PermaFrost-Attack: Stealth Pretraining Seeding(SPS) for planting Logic Landmines During LLM Training

/ April 29, 2026

arXiv:2604.22117v2 Announce Type: replace-cross
Abstract: Aligned large language models (LLMs) remain vulnerable to adversarial manipulation, and their reliance on web-scale pretraining creates a subtle but consequential attack surface. We study Steal…