Provide.ai - We Provide AI To Companies

ClawsBench: Evaluating Capability and Safety of LLM Productivity Agents in Simulated Workspaces

/ April 9, 2026

arXiv:2604.05172v2 Announce Type: replace
Abstract: Large language model (LLM) agents are increasingly deployed to automate productivity tasks (e.g., email, scheduling, document management), but evaluating them on live services is risky due to potenti…

cs.AI, cs.CL

FBS: Modeling Native Parallel Reading inside a Transformer

Tongxi Wang / April 9, 2026

arXiv:2601.21708v2 Announce Type: replace
Abstract: Large language models (LLMs) excel across many tasks, yet inference is still dominated by strictly token-by-token autoregression. Existing acceleration methods largely patch this pipeline and miss co…

cs.AI, cs.CL, cs.DL, cs.IR

Knowledge Graphs Generation from Cultural Heritage Texts: Combining LLMs and Ontological Engineering for Scholarly Debates

Andrea Schimmenti, Valentina Pasqual, Fabio Vitali, Marieke van Erp / April 9, 2026

arXiv:2511.10354v1 Announce Type: cross
Abstract: Cultural Heritage texts contain rich knowledge that is difficult to query systematically due to the challenges of converting unstructured discourse into structured Knowledge Graphs (KGs). This paper in…

cs.AI, cs.CL, cs.CV

UI-AGILE: Advancing GUI Agents with Effective Reinforcement Learning and Precise Inference-Time Grounding

/ April 9, 2026

arXiv:2507.22025v4 Announce Type: replace
Abstract: The emergence of Multimodal Large Language Models (MLLMs) has driven significant advances in Graphical User Interface (GUI) agent capabilities. Nevertheless, existing GUI agent training and inference…

cs.AI, cs.CV

Implantable Adaptive Cells: A Novel Enhancement for Pre-Trained U-Nets in Medical Image Segmentation

Emil Benedykciuk, Marcin Denkowski, Grzegorz W\'ojcik / April 9, 2026

arXiv:2405.03420v2 Announce Type: cross
Abstract: This paper introduces a novel approach to enhance the performance of pre-trained neural networks in medical image segmentation using gradient-based Neural Architecture Search (NAS) methods. We present …

cs.AI, cs.CL

How Much LLM Does a Self-Revising Agent Actually Need?

Seongwoo Jeong, Seonil Son / April 9, 2026

arXiv:2604.07236v1 Announce Type: new
Abstract: Recent LLM-based agents often place world modeling, planning, and reflection inside a single language model loop. This can produce capable behavior, but it makes a basic scientific question difficult to …

cs.AI, cs.CL, cs.LG

A Systematic Study of Retrieval Pipeline Design for Retrieval-Augmented Medical Question Answering

Nusrat Sultana, Abdullah Muhammad Moosa, Kazi Afzalur Rahman, Sajal Chandra Banik / April 9, 2026

arXiv:2604.07274v1 Announce Type: cross
Abstract: Large language models (LLMs) have demonstrated strong capabilities in medical question answering; however, purely parametric models often suffer from knowledge gaps and limited factual grounding. Retri…

cs.AI, cs.LG

Reason in Chains, Learn in Trees: Self-Rectification and Grafting for Multi-turn Agent Policy Optimization

Yu Li, Sizhe Tang, Tian Lan / April 9, 2026

arXiv:2604.07165v1 Announce Type: new
Abstract: Reinforcement learning for Large Language Model agents is often hindered by sparse rewards in multi-step reasoning tasks. Existing approaches like Group Relative Policy Optimization treat sampled traject…

cs.AI, cs.CV

Energy-based Tissue Manifolds for Longitudinal Multiparametric MRI Analysis

/ April 9, 2026

arXiv:2604.07180v1 Announce Type: cross
Abstract: We propose a geometric framework for longitudinal multi-parametric MRI analysis based on patient-specific energy modelling in sequence space. Rather than operating on images with spatial networks, each…

cs.AI, cs.LG

EVGeoQA: Benchmarking LLMs on Dynamic, Multi-Objective Geo-Spatial Exploration

Jianfei Wu, Zhichun Wang, Zhensheng Wang, Zhiyu He / April 9, 2026

arXiv:2604.07070v1 Announce Type: new
Abstract: While Large Language Models (LLMs) demonstrate remarkable reasoning capabilities, their potential for purpose-driven exploration in dynamic geo-spatial environments remains under-investigated. Existing G…