- Provide.ai - Page 4

Learning-Based Automated Adversarial Red-Teaming for Robustness Evaluation of Large Language Models

/ April 29, 2026

arXiv:2512.20677v4 Announce Type: replace-cross
Abstract: The increasing deployment of large language models (LLMs) in safety-critical applications raises fundamental challenges in systematically evaluating robustness against adversarial behaviors. Ex…

cs.AI, cs.CL, cs.SE

BenchGuard: Who Guards the Benchmarks? Automated Auditing of LLM Agent Benchmarks

/ April 29, 2026

arXiv:2604.24955v1 Announce Type: new
Abstract: As benchmarks grow in complexity, many apparent agent failures are not failures of the agent at all – they are failures of the benchmark itself: broken specifications, implicit assumptions, and rigid eva…

cs.CL, cs.SE

Don\’t Stop Early: Scalable Enterprise Deep Research with Controlled Information Flow and Evidence-Aware Termination

/ April 29, 2026

arXiv:2604.24978v1 Announce Type: new
Abstract: Enterprise deep research often fails to produce decision-ready reports due to uneven information coverage, context explosion, and premature stopping. We propose a scalable Enterprise Deep Research (EDR) …

cs.CL, cs.CV, cs.RO

MiMo-Embodied: X-Embodied Foundation Model Technical Report

/ April 29, 2026

arXiv:2511.16518v2 Announce Type: replace-cross
Abstract: We open-source MiMo-Embodied, the first cross-embodied foundation model to successfully integrate and achieve state-of-the-art performance in both Autonomous Driving and Embodied AI. MiMo-Embod…

cs.AI, cs.CL, cs.LG

PermaFrost-Attack: Stealth Pretraining Seeding(SPS) for planting Logic Landmines During LLM Training

/ April 29, 2026

arXiv:2604.22117v2 Announce Type: replace-cross
Abstract: Aligned large language models (LLMs) remain vulnerable to adversarial manipulation, and their reliance on web-scale pretraining creates a subtle but consequential attack surface. We study Steal…

cs.AI, cs.CL

Large Language Models Are Effective Human Annotation Assistants, But Not Good Independent Annotators

/ April 29, 2026

arXiv:2503.06778v3 Announce Type: replace
Abstract: Event annotation is important for identifying market changes, monitoring breaking news, and understanding sociological trends. Although expert annotators set the gold standards, human coding is expen…

cs.CL

OMHBench: Benchmarking Balanced and Grounded Omni-Modal Multi-Hop Reasoning

/ April 29, 2026

arXiv:2508.16198v3 Announce Type: replace
Abstract: Multimodal Large Language Models (MLLMs) have increasingly supported omni-modal processing across text, vision, and speech. However, existing evaluation frameworks for such models suffer from critica…

cs.AI, cs.CL

Analyzing LLM Reasoning to Uncover Mental Health Stigma

/ April 29, 2026

arXiv:2604.25053v1 Announce Type: new
Abstract: While large language models (LLMs) are increasingly being explored for mental health applications, recent studies reveal that they can exhibit stigma toward individuals with psychological conditions. Exi…

cs.CL, cs.HC

The Dynamics of Delusion: Modeling Bidirectional False Belief Amplification in Human-Chatbot Dialogue

/ April 29, 2026

arXiv:2604.25096v1 Announce Type: new
Abstract: There is growing concern that AI chatbots might fuel delusional beliefs in users. Some have suggested that humans and chatbots mutually reinforce false beliefs over time, but quantitative evidence is lac…

cs.CL

jina-embeddings-v5-text: Task-Targeted Embedding Distillation

/ April 29, 2026

arXiv:2602.15547v2 Announce Type: replace
Abstract: Text embedding models are widely used for semantic similarity tasks, including information retrieval, clustering, and classification. General-purpose models are typically trained with single- or mult…