- Provide.ai - Page 20

Khala: Scaling Acoustic Token Language Models Toward High-Fidelity Music Generation

/ May 6, 2026

arXiv:2605.01790v1 Announce Type: cross
Abstract: A common design pattern in high-quality music generation is to handle structure and fidelity in different representation spaces: a generator first models high-level structure, followed by diffusion-bas…

cs.AI, cs.CY

AcademiClaw: When Students Set Challenges for AI Agents

/ May 6, 2026

arXiv:2605.02661v1 Announce Type: new
Abstract: Benchmarks within the OpenClaw ecosystem have thus far evaluated exclusively assistant-level tasks, leaving the academic-level capabilities of OpenClaw largely unexamined. We introduce AcademiClaw, a bil…

cs.LG, cs.MM

Stable Multimodal Graph Unlearning via Feature-Dimension Aware Quantile Selection

/ May 6, 2026

arXiv:2605.03303v1 Announce Type: new
Abstract: Graph unlearning remains a critical technique for supporting privacy-preserving and sustainable multimodal graph learning. However, we observe that existing unlearning strategies tend to apply uniform pa…

cs.AI, cs.SD

TMD-Bench: A Multi-Level Evaluation Paradigm for Music-Dance Co-Generation

/ May 6, 2026

arXiv:2605.01809v1 Announce Type: cross
Abstract: Unified audio-visual generation is rapidly gaining industrial and creative relevance, enabling applications in virtual production and interactive media. However, when moving from general audio-video sy…

cs.AI

An explainable hypothesis-driven approach to Drug-Induced Liver Injury with HADES

/ May 6, 2026

arXiv:2605.02669v1 Announce Type: new
Abstract: Drug-induced liver injury (DILI) remains a leading cause of late-stage clinical trial attrition. However, existing computational predictors primarily rely on binary classification, a framing that limits …

cs.AI, cs.CR

Retrieval-Augmented LLMs for Security Incident Analysis

/ May 6, 2026

arXiv:2603.18196v3 Announce Type: replace-cross
Abstract: Investigating cybersecurity incidents requires collecting and analyzing evidence from multiple log sources, including intrusion detection alerts, network traffic records, and authentication eve…

cs.AI, cs.LG

LLM-ADAM: A Generalizable LLM Agent Framework for Pre-Print Anomaly Detection in Additive Manufacturing

/ May 6, 2026

arXiv:2605.03328v1 Announce Type: cross
Abstract: Additive manufacturing (AM) continues to transform modern manufacturing by enabling flexible, on-demand production of complex geometries across diverse industries. Fused filament fabrication (FFF) has …

cs.AI, cs.CR

Repurposing and Evaluating the (In)Feasibility of Dataset Poisoning enabled Watermarking for Contrastive Learning

/ May 6, 2026

arXiv:2605.01834v1 Announce Type: cross
Abstract: Contrastive learning (CL) reduces annotation cost via auto-derived supervisory signals. Since large-scale in-house CL datasets are infeasible, reliance on third-party or internet data is common. Recent…

cs.AI, eess.AS

PS-TTS: Phonetic Synchronization in Text-to-Speech for Achieving Natural Automated Dubbing

/ May 6, 2026

arXiv:2604.09111v4 Announce Type: replace-cross
Abstract: Recently, artificial intelligence-based dubbing technology has advanced, enabling automated dubbing (AD) to convert the source speech of a video into target speech in different languages. Howev…

cs.AI

MSEarth: A Multimodal Benchmark for Earth Science Phenomenon Discovery with MLLMs

/ May 6, 2026

arXiv:2505.20740v3 Announce Type: replace
Abstract: The rapid advancement of multimodal large language models (MLLMs) offers new opportunities for complex scientific challenges, yet their application in earth science-especially at the graduate level-r…