- Provide.ai - Page 102

StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

/ May 8, 2026

arXiv:2605.06642v1 Announce Type: new
Abstract: Large language models (LLMs) are increasingly used as interactive agents, but optimizing them for long-horizon decision making remains difficult because current methods are largely purely reactive, which…

cs.LG

Knowing but Not Correcting: Routine Task Requests Suppress Factual Correction in LLMs

/ May 8, 2026

arXiv:2605.05957v1 Announce Type: new
Abstract: LLMs reliably correct false claims when presented in isolation, yet when the same claims are embedded in task-oriented requests, they often comply rather than correct. We term this failure mode \emph{cor…

cs.AI, cs.CV, cs.LG

ActCam: Zero-Shot Joint Camera and 3D Motion Control for Video Generation

/ May 8, 2026

arXiv:2605.06667v1 Announce Type: cross
Abstract: For artistic applications, video generation requires fine-grained control over both performance and cinematography, i.e., the actor’s motion and the camera trajectory. We present ActCam, a zero-shot me…

cs.CL

KORE: Enhancing Knowledge Injection for Large Multimodal Models via Knowledge-Oriented Controls

/ May 8, 2026

arXiv:2510.19316v2 Announce Type: replace
Abstract: Large Multimodal Models encode extensive factual knowledge in their pre-trained weights. However, its knowledge remains static and limited, unable to keep pace with real-world developments, which hin…

cs.AI

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

/ May 8, 2026

arXiv:2605.06130v1 Announce Type: new
Abstract: A persistent skill library allows language model agents to reuse successful strategies across tasks. Maintaining such a library requires three coupled capabilities. The agent selects a relevant skill, ut…

cs.AI, cs.CL, eess.AS

WavCube: Unifying Speech Representation for Understanding and Generation via Semantic-Acoustic Joint Modeling

/ May 8, 2026

arXiv:2605.06407v1 Announce Type: cross
Abstract: Integrating speech understanding and generation is a pivotal step toward building unified speech models. However, the different representations required for these two tasks currently pose significant c…

cs.LG

Efficient Serving for Dynamic Agent Workflows with Prediction-based KV-Cache Management

/ May 8, 2026

arXiv:2605.06472v1 Announce Type: new
Abstract: LLM-based workflows compose specialized agents to execute complex tasks, and these agents usually share substantial context, allowing KV-Cache reuse to save computation. Existing approaches either manage…

Data Privacy, Laws and Regulations, LinkedIn, Privacy, Security

LinkedIn illegally blocking free accounts from seeing ‘who’s viewed your profile’ data, group alleges

/ May 7, 2026

A LinkedIn feature that allows paid subscribers to view a list of visitors to their profile should be made available to all EU users free of charge to comply with the region’s General Data Protection Regulation (GDPR), a legal co…

cs.AI, cs.SE

ProgramBench: Can Language Models Rebuild Programs From Scratch?

/ May 7, 2026

arXiv:2605.03546v1 Announce Type: cross
Abstract: Turning ideas into full software projects from scratch has become a popular use case for language models. Agents are being deployed to seed, maintain, and grow codebases over extended periods with mini…

cs.AI, cs.LG, cs.SD, eess.SP

PHALAR: Phasors for Learned Musical Audio Representations

/ May 7, 2026

arXiv:2605.03929v2 Announce Type: cross
Abstract: Stem retrieval, the task of matching missing stems to a given audio submix, is a key challenge currently limited by models that discard temporal information. We introduce PHALAR, a contrastive framewor…