- Provide.ai - Page 98

Estimating Tail Risks in Language Model Output Distributions

/ April 27, 2026

arXiv:2604.22167v1 Announce Type: cross
Abstract: Language models are increasingly capable and are being rapidly deployed on a population-level scale. As a result, the safety of these models is increasingly high-stakes. Fortunately, advances in alignm…

cs.AI, cs.IR

ResRank: Unifying Retrieval and Listwise Reranking via End-to-End Joint Training with Residual Passage Compression

/ April 27, 2026

arXiv:2604.22180v1 Announce Type: cross
Abstract: Large language model (LLM) based listwise reranking has emerged as the dominant paradigm for achieving state-of-the-art ranking effectiveness in information retrieval. However, its reliance on feeding …

cs.AI, cs.CL

When Models Outthink Their Safety: Unveiling and Mitigating Self-Jailbreak in Large Reasoning Models

/ April 27, 2026

arXiv:2510.21285v4 Announce Type: replace
Abstract: Large Reasoning Models (LRMs) achieve strong performance on complex multi-step reasoning, yet they still exhibit severe safety failures such as harmful content generation. Existing methods often appl…

cs.AI, cs.CL, cs.SE

Evaluating LLM-Based Goal Extraction in Requirements Engineering: Prompting Strategies and Their Limitations

/ April 27, 2026

arXiv:2604.22207v1 Announce Type: cross
Abstract: Due to the textual and repetitive nature of many Requirements Engineering (RE) artefacts, Large Language Models (LLMs) have proven useful to automate their generation and processing. In this paper, we …

cs.AI, cs.CL, cs.SD, eess.AS

UniSonate: A Unified Model for Speech, Music, and Sound Effect Generation with Text Instructions

/ April 27, 2026

arXiv:2604.22209v1 Announce Type: cross
Abstract: Generative audio modeling has largely been fragmented into specialized tasks, text-to-speech (TTS), text-to-music (TTM), and text-to-audio (TTA), each operating under heterogeneous control paradigms. U…

cs.AI, cs.CV

PSI: A Benchmark for Human Interpretation and Response in Traffic Interactions

/ April 27, 2026

arXiv:2112.02604v3 Announce Type: replace-cross
Abstract: Accurately modeling pedestrian intention and understanding driver decision-making processes are critical for the development of safe and socially aware autonomous driving systems. We introduce …

cs.CL

Beyond N-gram: Data-Aware X-GRAM Extraction for Efficient Embedding Parameter Scaling

/ April 27, 2026

arXiv:2604.21724v2 Announce Type: replace
Abstract: Large token-indexed lookup tables provide a compute-decoupled scaling path, but their practical gains are often limited by poor parameter efficiency and rapid memory growth. We attribute these limita…

cs.CL

DimABSA: Building Multilingual and Multidomain Datasets for Dimensional Aspect-Based Sentiment Analysis

/ April 27, 2026

arXiv:2601.23022v3 Announce Type: replace
Abstract: Aspect-Based Sentiment Analysis (ABSA) focuses on extracting sentiment at a fine-grained aspect level and has been widely applied across real-world domains. However, existing ABSA research relies on …

cs.CV

CharTide: Data-Centric Chart-to-Code Generation via Tri-Perspective Tuning and Inquiry-Driven Evolution

/ April 27, 2026

arXiv:2604.22192v1 Announce Type: new
Abstract: Chart-to-code generation demands strict visual precision and syntactic correctness from Vision-Language Models (VLMs). However, existing approaches are fundamentally constrained by data-centric limitatio…

cs.CV

OccDirector: Language-Guided Behavior and Interaction Generation in 4D Occupancy Space

/ April 27, 2026

arXiv:2604.22240v1 Announce Type: new
Abstract: Generative world models increasingly rely on 4D occupancy for realistic autonomous driving simulation. However, existing generation frameworks depend on rigid geometric conditions (e.g., explicit traject…