- Provide.ai - Page 330

Reason Only When Needed: Efficient Generative Reward Modeling via Model-Internal Uncertainty

/ April 17, 2026

arXiv:2604.10072v2 Announce Type: replace
Abstract: Recent advancements in the Generative Reward Model (GRM) have demonstrated its potential to enhance the reasoning abilities of LLMs through Chain-of-Thought (CoT) prompting. Despite these gains, exis…

cs.CL

Who Wrote This Line? Evaluating the Detection of LLM-Generated Classical Chinese Poetry

/ April 17, 2026

arXiv:2604.10101v2 Announce Type: replace
Abstract: The rapid development of large language models (LLMs) has extended text generation tasks into the literary domain. However, AI-generated literary creations has raised increasingly prominent issues of…

cs.CL

Foresight Optimization for Strategic Reasoning in Large Language Models

/ April 17, 2026

arXiv:2604.13592v2 Announce Type: replace
Abstract: Reasoning capabilities in large language models (LLMs) have generally advanced significantly. However, it is still challenging for existing reasoning-based LLMs to perform effective decision-making a…

cs.AI, cs.CL

MARS$^2$: Scaling Multi-Agent Tree Search via Reinforcement Learning for Code Generation

/ April 17, 2026

arXiv:2604.14564v1 Announce Type: cross
Abstract: Reinforcement learning (RL) paradigms have demonstrated strong performance on reasoning-intensive tasks such as code generation. However, limited trajectory diversity often leads to diminishing returns…

cs.CL

Your LLM Agents are Temporally Blind: The Misalignment Between Tool Use Decisions and Human Time Perception

/ April 17, 2026

arXiv:2510.23853v3 Announce Type: replace
Abstract: Large language model (LLM) agents are increasingly used to interact with and execute tasks in dynamic environments. However, a critical yet overlooked limitation of these agents is that they, by defa…

cs.CL, cs.IR

ProRank: Prompt Warmup via Reinforcement Learning for Small Language Models Reranking

/ April 17, 2026

arXiv:2506.03487v3 Announce Type: replace-cross
Abstract: Reranking is fundamental to information retrieval and retrieval-augmented generation, with recent Large Language Models (LLMs) significantly advancing reranking quality. Most current works rely…

cs.CL

Challenges in Translating Technical Lectures: Insights from the NPTEL

/ April 17, 2026

arXiv:2602.08698v2 Announce Type: replace
Abstract: This study examines the practical applications and methodological implications of Machine Translation in Indian Languages, specifically Bangla, Malayalam, and Telugu, within emerging translation work…

cs.AI, cs.CL, cs.MA

TopoDIM: One-shot Topology Generation of Diverse Interaction Modes for Multi-Agent Systems

/ April 17, 2026

arXiv:2601.10120v2 Announce Type: replace-cross
Abstract: Optimizing communication topology in LLM-based multi-agent system is critical for enabling collective intelligence. Existing methods mainly rely on spatio-temporal interaction paradigms, where …

cs.AI, cs.CL

METER: Evaluating Multi-Level Contextual Causal Reasoning in Large Language Models

/ April 17, 2026

arXiv:2604.11502v2 Announce Type: replace
Abstract: Contextual causal reasoning is a critical yet challenging capability for Large Language Models (LLMs). Existing benchmarks, however, often evaluate this skill in fragmented settings, failing to ensur…

cs.AI, cs.CL

ReviewGrounder: Improving Review Substantiveness with Rubric-Guided, Tool-Integrated Agents

/ April 17, 2026

arXiv:2604.14261v1 Announce Type: new
Abstract: The rapid rise in AI conference submissions has driven increasing exploration of large language models (LLMs) for peer review support. However, LLM-based reviewers often generate superficial, formulaic c…