- Provide.ai - Page 60

Beyond Sentiment: A Multi-Agent Pipeline for Actionable Business Advice from Reviews

/ May 5, 2026

arXiv:2601.12024v2 Announce Type: replace
Abstract: Customer reviews contain valuable signals about service quality, but converting large-scale review corpora into actionable business recommendations remains difficult. Standard sentiment/aspect analys…

cs.AI, cs.CL

AgentXRay: White-Boxing Agentic Systems via Workflow Reconstruction

/ May 5, 2026

arXiv:2602.05353v3 Announce Type: replace
Abstract: Large Language Models have shown strong capabilities in complex problem solving, yet many agentic systems remain difficult to interpret and control due to opaque internal workflows. While some framew…

cs.CL

The Cylindrical Representation Hypothesis for Language Model Steering

/ May 5, 2026

arXiv:2605.01844v1 Announce Type: new
Abstract: Steering is a widely used technique for controlling large language models, yet its effects are often unstable and hard to predict. Existing theoretical accounts are largely based on the Linear Representa…

cs.CL, cs.CR

Watermarking LLM Agent Trajectories

/ May 5, 2026

arXiv:2602.18700v2 Announce Type: replace-cross
Abstract: LLM agents rely heavily on high-quality trajectory data to guide their problem-solving behaviors, yet producing such data requires substantial task design, high-capacity model generation, and m…

cs.CL, cs.CV

MIRL: Mutual Information-Guided Reinforcement Learning for Vision-Language Models

/ May 5, 2026

arXiv:2605.01520v1 Announce Type: cross
Abstract: Vision-Language Models (VLMs) frequently suffer from visual perception errors and hallucinations that compromise answer accuracy in complex reasoning tasks. Reinforcement Learning with Verifiable Rewar…

cs.AI, cs.CL, cs.LG

VeRO: An Evaluation Harness for Agents to Optimize Agents

/ May 5, 2026

arXiv:2602.22480v2 Announce Type: replace
Abstract: An important emerging application of coding agents is agent optimization: the iterative improvement of a target agent through edit-execute-evaluate cycles. Despite its relevance, the community lacks …

cs.CL, cs.IR

Led to Mislead: Adversarial Content Injection for Attacks on Neural Ranking Models

/ May 5, 2026

arXiv:2605.01591v1 Announce Type: cross
Abstract: Neural Ranking Models (NRMs) are central to modern information retrieval but remain highly vulnerable to adversarial manipulation. Existing attacks often rely on heuristics or surrogate models, limitin…

cs.CL, cs.CV

Medical thinking with multiple images

/ May 5, 2026

arXiv:2604.16506v2 Announce Type: replace-cross
Abstract: Large language models perform well on many medical QA benchmarks, but real clinical reasoning often requires integrating evidence across multiple images rather than interpreting a single view. …

cs.AI, cs.RO

VILAS: A VLA-Integrated Low-cost Architecture with Soft Grasping for Robotic Manipulation

/ May 5, 2026

arXiv:2605.02037v1 Announce Type: cross
Abstract: We present VILAS, a fully low-cost, modular robotic manipulation platform designed to support end-to-end vision-language-action (VLA) policy learning and deployment on accessible hardware. The system i…

cs.CV, cs.RO

VoxAfford: Multi-Scale Voxel-Token Fusion for Open-Vocabulary 3D Affordance Detection

/ May 5, 2026

arXiv:2605.01365v1 Announce Type: cross
Abstract: Open-vocabulary 3D affordance detection requires localizing interaction regions on point clouds given novel affordance descriptions. Recent methods extend multimodal large language models (MLLMs) with …