- Provide.ai - Page 100

UniSonate: A Unified Model for Speech, Music, and Sound Effect Generation with Text Instructions

/ April 27, 2026

arXiv:2604.22209v1 Announce Type: cross
Abstract: Generative audio modeling has largely been fragmented into specialized tasks, text-to-speech (TTS), text-to-music (TTM), and text-to-audio (TTA), each operating under heterogeneous control paradigms. U…

cs.CV

Uni-Encoder Meets Multi-Encoders: Representation Before Fusion for Brain Tumor Segmentation with Missing Modalities

/ April 27, 2026

arXiv:2604.22177v1 Announce Type: new
Abstract: Multimodal MRI offers complementary information for brain tumor segmentation, but clinical scans often lack one or more modalities, which degrades segmentation performance. In this paper, we propose UniM…

cs.AI, cs.CV

PSI: A Benchmark for Human Interpretation and Response in Traffic Interactions

/ April 27, 2026

arXiv:2112.02604v3 Announce Type: replace-cross
Abstract: Accurately modeling pedestrian intention and understanding driver decision-making processes are critical for the development of safe and socially aware autonomous driving systems. We introduce …

cs.CL

Beyond N-gram: Data-Aware X-GRAM Extraction for Efficient Embedding Parameter Scaling

/ April 27, 2026

arXiv:2604.21724v2 Announce Type: replace
Abstract: Large token-indexed lookup tables provide a compute-decoupled scaling path, but their practical gains are often limited by poor parameter efficiency and rapid memory growth. We attribute these limita…

cs.LG

Toward Robust and Efficient ML-Based GPU Caching for Modern Inference

/ April 27, 2026

arXiv:2509.20979v2 Announce Type: replace
Abstract: In modern GPU inference, cache efficiency remains a major bottleneck, and heuristic policies such as \textsc{LRU} can perform far worse than the offline optimum. Existing learning-based caching syste…

cs.CL

DimABSA: Building Multilingual and Multidomain Datasets for Dimensional Aspect-Based Sentiment Analysis

/ April 27, 2026

arXiv:2601.23022v3 Announce Type: replace
Abstract: Aspect-Based Sentiment Analysis (ABSA) focuses on extracting sentiment at a fine-grained aspect level and has been widely applied across real-world domains. However, existing ABSA research relies on …

cs.CV

CharTide: Data-Centric Chart-to-Code Generation via Tri-Perspective Tuning and Inquiry-Driven Evolution

/ April 27, 2026

arXiv:2604.22192v1 Announce Type: new
Abstract: Chart-to-code generation demands strict visual precision and syntactic correctness from Vision-Language Models (VLMs). However, existing approaches are fundamentally constrained by data-centric limitatio…

cs.LG

Leveraging Teleconnections with Physics-Informed Graph Attention Networks for Long-Range Extreme Rainfall Forecasting in Thailand

/ April 27, 2026

arXiv:2510.12328v5 Announce Type: replace
Abstract: Accurate rainfall forecasting, particularly for extreme events, remains a significant challenge in climatology and the Earth system. This paper presents novel physics-informed Graph Neural Networks (…

cs.LG

A Nationwide Japanese Medical Claims Foundation Model: Balancing Model Scaling and Task-Specific Computational Efficiency

/ April 27, 2026

arXiv:2604.22348v1 Announce Type: new
Abstract: Clinical risk prediction using longitudinal medical data supports individualized care. Self-supervised foundation models have emerged as a promising approach for leveraging large-scale unlabeled healthca…

cs.CV

OccDirector: Language-Guided Behavior and Interaction Generation in 4D Occupancy Space

/ April 27, 2026

arXiv:2604.22240v1 Announce Type: new
Abstract: Generative world models increasingly rely on 4D occupancy for realistic autonomous driving simulation. However, existing generation frameworks depend on rigid geometric conditions (e.g., explicit traject…