- Provide.ai - Page 427

Redirected, Not Removed: Task-Dependent Stereotyping Reveals the Limits of LLM Alignments

/ April 6, 2026

arXiv:2604.02669v1 Announce Type: new
Abstract: How biased is a language model? The answer depends on how you ask. A model that refuses to choose between castes for a leadership role will, in a fill-in-the-blank task, reliably associate upper castes w…

cs.AI, cs.CL, cs.SE

StructEval: Benchmarking LLMs’ Capabilities to Generate Structural Outputs

/ April 6, 2026

arXiv:2505.20139v3 Announce Type: replace-cross
Abstract: As Large Language Models (LLMs) become integral to software development workflows, their ability to generate structured outputs has become critically important. We introduce StructEval, a compr…

cs.CL

When Modalities Remember: Continual Learning for Multimodal Knowledge Graphs

/ April 6, 2026

arXiv:2604.02778v1 Announce Type: new
Abstract: Real-world multimodal knowledge graphs (MMKGs) are dynamic, with new entities, relations, and multimodal knowledge emerging over time. Existing continual knowledge graph reasoning (CKGR) methods focus on…

cs.CL

LLM-based Atomic Propositions help weak extractors: Evaluation of a Propositioner for triplet extraction

/ April 6, 2026

arXiv:2604.02866v1 Announce Type: new
Abstract: Knowledge Graph construction from natural language requires extracting structured triplets from complex, information-dense sentences. In this paper, we investigate if the decomposition of text into atomi…

cs.AI, cs.CY, cs.HC, cs.MA

When Openclaw Agents Learn from Each Other: Insights from Emergent AI Agent Communities for Human-AI Partnership in Education

/ April 6, 2026

arXiv:2603.16663v3 Announce Type: replace-cross
Abstract: The AIED community envisions AI evolving “from tools to teammates,” yet our understanding of AI teammates remains limited to dyadic human-AI interactions. We offer a different vantage point: a …

cs.AI, cs.CL

LogicPoison: Logical Attacks on Graph Retrieval-Augmented Generation

/ April 6, 2026

arXiv:2604.02954v1 Announce Type: cross
Abstract: Graph-based Retrieval-Augmented Generation (GraphRAG) enhances the reasoning capabilities of Large Language Models (LLMs) by grounding their responses in structured knowledge graphs. Leveraging communi…

cs.CL

Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Output Prefilling

/ April 6, 2026

arXiv:2505.15323v2 Announce Type: replace
Abstract: Large Language Models (LLMs) are increasingly evaluated on multiple-choice question answering (MCQA) tasks using *first-token probability* (FTP), which selects the answer option whose initial token h…

cs.CL

Quick on the Uptake: Eliciting Implicit Intents from Human Demonstrations for Personalized Mobile-Use Agents

/ April 6, 2026

arXiv:2508.08645v2 Announce Type: replace
Abstract: As multimodal large language models advance rapidly, the automation of mobile tasks has become increasingly feasible through the use of mobile-use agents that mimic human interactions from graphical …

cs.AI, cs.CR

Automated Malware Family Classification using Weighted Hierarchical Ensembles of Large Language Models

/ April 6, 2026

arXiv:2604.02490v1 Announce Type: cross
Abstract: Malware family classification remains a challenging task in automated malware analysis, particularly in real-world settings characterized by obfuscation, packing, and rapidly evolving threats. Existing…

cs.CL

VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents

/ April 6, 2026

arXiv:2509.07553v3 Announce Type: replace
Abstract: With the rapid progress of multimodal large language models, operating system (OS) agents become increasingly capable of automating tasks through on-device graphical user interfaces (GUIs). However, …