- Provide.ai - Page 294

Vision Hopfield Memory Networks

/ March 27, 2026

arXiv:2603.25157v1 Announce Type: cross
Abstract: Recent vision and multimodal foundation backbones, such as Transformer families and state-space models like Mamba, have achieved remarkable progress, enabling unified modeling across images, text, and …

cs.CV, cs.RO

Diagnose, Correct, and Learn from Manipulation Failures via Visual Symbols

/ March 27, 2026

arXiv:2512.02787v3 Announce Type: replace
Abstract: Vision-Language-Action (VLA) models have recently achieved remarkable progress in robotic manipulation, yet they remain limited in failure diagnosis and learning from failures. Additionally, existing…

cs.CL, cs.HC

Machine Learning for Enhancing Deliberation in Online Political Discussions and Participatory Processes: A Survey

/ March 27, 2026

arXiv:2506.02533v2 Announce Type: replace
Abstract: Political online participation in the form of discussing political issues and exchanging opinions among citizens is gaining importance with more and more formats being held digitally. To come to a de…

cs.CL

Can GRPO Boost Complex Multimodal Table Understanding?

/ March 27, 2026

arXiv:2509.16889v3 Announce Type: replace
Abstract: Existing table understanding methods face challenges due to complex table structures and intricate logical reasoning. While supervised finetuning (SFT) dominates existing research, reinforcement lear…

cs.AI, cs.CL

DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models

/ March 27, 2026

arXiv:2509.24296v2 Announce Type: replace
Abstract: The rapid advancement of Diffusion Large Language Models (dLLMs) introduces unprecedented vulnerabilities that are fundamentally distinct from Autoregressive LLMs, stemming from their iterative and p…

cs.CL

Self-Improvement of Large Language Models: A Technical Overview and Future Outlook

/ March 27, 2026

arXiv:2603.25681v1 Announce Type: new
Abstract: As large language models (LLMs) continue to advance, improving them solely through human supervision is becoming increasingly costly and limited in scalability. As models approach human-level capabilitie…

cs.CL

CNSocialDepress: A Chinese Social Media Dataset for Depression Risk Detection and Structured Analysis

/ March 27, 2026

arXiv:2510.11233v2 Announce Type: replace
Abstract: Depression is a pressing global public health issue, yet publicly available Chinese-language resources for depression risk detection remain scarce and largely focus on binary classification. To addre…

cs.AI, cs.CL

A cross-species neural foundation model for end-to-end speech decoding

/ March 27, 2026

arXiv:2511.21740v4 Announce Type: replace
Abstract: Speech brain-computer interfaces (BCIs) aim to restore communication for people with paralysis by translating neural activity into text. Most systems use cascaded frameworks that decode phonemes befo…

cs.CL

From Evidence-Based Medicine to Knowledge Graph: Retrieval-Augmented Generation for Sports Rehabilitation and a Domain Benchmark

/ March 27, 2026

arXiv:2601.00216v2 Announce Type: replace
Abstract: Current medical retrieval-augmented generation (RAG) approaches overlook evidence-based medicine (EBM) principles, leading to two key gaps: (1) the lack of PICO alignment between queries and retrieve…

cs.CL, cs.LG

Training LLMs for Multi-Step Tool Orchestration with Constrained Data Synthesis and Graduated Rewards

/ March 27, 2026

arXiv:2603.24709v1 Announce Type: cross
Abstract: Multi-step tool orchestration, where LLMs must invoke multiple dependent APIs in the correct order while propagating intermediate outputs, remains challenging. State-of-the-art models frequently fail o…