- Provide.ai - Page 445

OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation

/ April 21, 2026

arXiv:2506.05606v5 Announce Type: replace
Abstract: Can large language models (LLMs) accurately simulate the next web action of a specific user? While LLMs have shown promising capabilities in generating “believable” human behaviors, evaluating thei…

cs.AI, cs.LG, q-bio.QM

A Systematic Survey and Benchmark of Deep Learning for Molecular Property Prediction in the Foundation Model Era

/ April 21, 2026

arXiv:2604.16586v1 Announce Type: new
Abstract: Molecular property prediction integrates quantum chemistry, cheminformatics, and deep learning to connect molecular structure with physicochemical and biological behavior. This survey traces four complem…

cs.CV, cs.HC

Deep Learning for Virtual Reality User Identification: A Benchmark

/ April 21, 2026

arXiv:2604.16341v1 Announce Type: cross
Abstract: Virtual Reality (VR) applications require robust user identification systems to ensure secure access to equipment and protect worker identities. Motion tracking data from VR headsets and controllers ha…

cs.LG, physics.flu-dyn

A Two-Phase Deep Learning Framework for Adaptive Time-Stepping in High-Speed Flow Modeling

/ April 21, 2026

arXiv:2506.07969v2 Announce Type: replace
Abstract: We consider the problem of modeling high-speed flows using machine learning methods. While most prior studies focus on low-speed fluid flows in which uniform time-stepping is practical, flows approac…

cs.AI, cs.LO, cs.PL

Just Type It in Isabelle! AI Agents Drafting, Mechanizing, and Generalizing from Human Hints

/ April 20, 2026

arXiv:2604.15713v1 Announce Type: cross
Abstract: Type annotations are essential when printing terms in a way that preserves their meaning under reparsing and type inference. We study the problem of complete and minimal type annotations for rank-one p…

cs.AI, cs.SE, quant-ph

A PennyLane-Centric Dataset to Enhance LLM-based Quantum Code Generation using RAG

/ April 20, 2026

arXiv:2503.02497v4 Announce Type: replace-cross
Abstract: Large Language Models (LLMs) offer powerful capabilities in code generation, natural language understanding, and domain-specific reasoning. Their application to quantum software development rem…

cs.AI, cs.CE, cs.DC

cuNNQS-SCI: A Fully GPU-Accelerated Framework for High-Performance Configuration Interaction Selection withNeural Network QQantum States

/ April 20, 2026

arXiv:2604.15768v1 Announce Type: cross
Abstract: AI-driven methods have demonstrated considerable success in tackling the central challenge of accurately solving the Schr\”odinger equation for complex many-body systems. Among neural network quantum s…

cs.AI, cs.SE

Capture the Flags: Family-Based Evaluation of Agentic LLMs via Semantics-Preserving Transformations

/ April 20, 2026

arXiv:2602.05523v2 Announce Type: replace-cross
Abstract: Agentic large language models (LLMs) are increasingly evaluated on cybersecurity tasks using capture-the-flag (CTF) benchmarks, yet existing pointwise benchmarks offer limited insight into agen…

cs.AI, cs.CR

Security Threat Modeling for Emerging AI-Agent Protocols: A Comparative Analysis of MCP, A2A, Agora, and ANP

/ April 20, 2026

arXiv:2602.11327v2 Announce Type: replace-cross
Abstract: The rapid development of the AI agent communication protocols, including the Model Context Protocol (MCP), Agent2Agent (A2A), Agora, and Agent Network Protocol (ANP), is reshaping how AI agents…

cs.AI, cs.CR

The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents

/ April 20, 2026

arXiv:2604.10577v2 Announce Type: replace-cross
Abstract: Computer-use agents (CUAs) can now autonomously complete complex tasks in real digital environments, but when misled, they can also be used to automate harmful actions programmatically. Existin…