- Provide.ai - Page 333

MemGround: Long-Term Memory Evaluation Kit for Large Language Models in Gamified Scenarios

/ April 17, 2026

arXiv:2604.14158v1 Announce Type: new
Abstract: Current evaluations of long-term memory in LLMs are fundamentally static. By fixating on simple retrieval and short-context inference, they neglect the multifaceted nature of complex memory systems, such…

cs.AI, cs.CL, cs.IR

IG-Search: Step-Level Information Gain Rewards for Search-Augmented Reasoning

/ April 17, 2026

arXiv:2604.15148v1 Announce Type: cross
Abstract: Reinforcement learning has emerged as an effective paradigm for training large language models to perform search-augmented reasoning. However, existing approaches rely on trajectory-level rewards that …

cs.CL

Foresight Optimization for Strategic Reasoning in Large Language Models

/ April 17, 2026

arXiv:2604.13592v2 Announce Type: replace
Abstract: Reasoning capabilities in large language models (LLMs) have generally advanced significantly. However, it is still challenging for existing reasoning-based LLMs to perform effective decision-making a…

cs.AI, cs.CL

MARS$^2$: Scaling Multi-Agent Tree Search via Reinforcement Learning for Code Generation

/ April 17, 2026

arXiv:2604.14564v1 Announce Type: cross
Abstract: Reinforcement learning (RL) paradigms have demonstrated strong performance on reasoning-intensive tasks such as code generation. However, limited trajectory diversity often leads to diminishing returns…

cs.CL

Your LLM Agents are Temporally Blind: The Misalignment Between Tool Use Decisions and Human Time Perception

/ April 17, 2026

arXiv:2510.23853v3 Announce Type: replace
Abstract: Large language model (LLM) agents are increasingly used to interact with and execute tasks in dynamic environments. However, a critical yet overlooked limitation of these agents is that they, by defa…

cs.CL, cs.IR

ProRank: Prompt Warmup via Reinforcement Learning for Small Language Models Reranking

/ April 17, 2026

arXiv:2506.03487v3 Announce Type: replace-cross
Abstract: Reranking is fundamental to information retrieval and retrieval-augmented generation, with recent Large Language Models (LLMs) significantly advancing reranking quality. Most current works rely…

cs.CL

Challenges in Translating Technical Lectures: Insights from the NPTEL

/ April 17, 2026

arXiv:2602.08698v2 Announce Type: replace
Abstract: This study examines the practical applications and methodological implications of Machine Translation in Indian Languages, specifically Bangla, Malayalam, and Telugu, within emerging translation work…

cs.AI, cs.CL, cs.MA

TopoDIM: One-shot Topology Generation of Diverse Interaction Modes for Multi-Agent Systems

/ April 17, 2026

arXiv:2601.10120v2 Announce Type: replace-cross
Abstract: Optimizing communication topology in LLM-based multi-agent system is critical for enabling collective intelligence. Existing methods mainly rely on spatio-temporal interaction paradigms, where …

cs.AI, cs.CL

METER: Evaluating Multi-Level Contextual Causal Reasoning in Large Language Models

/ April 17, 2026

arXiv:2604.11502v2 Announce Type: replace
Abstract: Contextual causal reasoning is a critical yet challenging capability for Large Language Models (LLMs). Existing benchmarks, however, often evaluate this skill in fragmented settings, failing to ensur…

cs.AI, cs.CL

ReviewGrounder: Improving Review Substantiveness with Rubric-Guided, Tool-Integrated Agents

/ April 17, 2026

arXiv:2604.14261v1 Announce Type: new
Abstract: The rapid rise in AI conference submissions has driven increasing exploration of large language models (LLMs) for peer review support. However, LLM-based reviewers often generate superficial, formulaic c…