- Provide.ai - Page 56

Reward Modeling from Natural Language Human Feedback

/ May 4, 2026

arXiv:2601.07349v3 Announce Type: replace
Abstract: Reinforcement Learning with Verifiable reward (RLVR) on preference data has become the mainstream approach for training Generative Reward Models (GRMs). Typically in pairwise rewarding tasks, GRMs ge…

cs.CV

PhysiGen: Integrating Collision-Aware Physical Constraints for High-Fidelity Human-Human Interaction Generation

/ May 4, 2026

arXiv:2605.00517v1 Announce Type: new
Abstract: Despite substantial progress in text-driven 3D human motion synthesis, generating realistic multi-person interaction sequences remains challenging. Notably, body inter-penetration is a pervasive issue fr…

cs.CL

BanglaSocialBench: A Benchmark for Evaluating Sociopragmatic and Cultural Alignment of LLMs in Bangladeshi Social Interaction

/ May 4, 2026

arXiv:2603.15949v3 Announce Type: replace
Abstract: Large Language Models have demonstrated strong multilingual fluency, yet fluency alone does not guarantee socially appropriate language use. In high-context languages, communicative competence requir…

cs.CV, cs.LG

Vesselpose: Vessel Graph Reconstruction from Learned Voxel-wise Direction Vectors in 3D Vascular Images

/ May 4, 2026

arXiv:2605.00538v1 Announce Type: new
Abstract: Blood vessel segmentation and -tracing are essential tasks in many medical imaging applications. Although numerous methods exist, the prevailing segment-then-fix paradigm is fundamentally limited regardi…

cs.CL

SCOPE:Planning for Hybrid Querying over Clinical Trial Data

/ May 4, 2026

arXiv:2604.25120v2 Announce Type: replace
Abstract: We study clinical trial table reasoning, where answers are not directly stored in visible cells but must be reasoned from semantic understanding through normalization, classification, extraction, or …

cs.AI, cs.LG, stat.ML

Position: agentic AI orchestration should be Bayes-consistent

/ May 4, 2026

arXiv:2605.00742v1 Announce Type: new
Abstract: LLMs excel at predictive tasks and complex reasoning tasks, but many high-value deployments rely on decisions under uncertainty, for example, which tool to call, which expert to consult, or how many reso…

cs.AI, cs.CL, cs.LG

Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning

/ May 4, 2026

arXiv:2605.00347v1 Announce Type: cross
Abstract: Given the rapidly growing capabilities of vision-language models (VLMs), extending them to interactive decision-making tasks such as video games has emerged as a promising frontier. However, existing a…

Artificial Intelligence, Identity and Access Management, Security

AI agents can bypass guardrails and put credentials at risk, Okta study finds

/ May 1, 2026

An AI agent that revealed sensitive data without being asked. An agent that overruled its own guardrails. Another that sent credentials to an attacker via Telegram, because it forgot it wasn’t supposed to do so after a reset.

…

cs.CV

Action Motifs: Self-Supervised Hierarchical Representation of Human Body Movements

/ May 1, 2026

arXiv:2604.28173v1 Announce Type: new
Abstract: Effective human behavior modeling requires a representation of the human body movement that capitalizes on its compositionality. We propose a hierarchical representation consisting of Action Atoms that c…

cs.CV, cs.CY

AEGIS: A Holistic Benchmark for Evaluating Forensic Analysis of AI-Generated Academic Images

/ May 1, 2026

arXiv:2604.28177v1 Announce Type: new
Abstract: We introduce AEGIS, A holistic benchmark for Evaluating forensic analysis of AI-Generated academic ImageS. Compared to existing benchmarks, AEGIS features three key advances: (1) Domain-Specific Complexi…