- Provide.ai - Page 33

Tokens-per-Parameter Coverage Is Critical for Robust LLM Scaling Law Extrapolation

/ May 12, 2026

arXiv:2605.08541v1 Announce Type: new
Abstract: Neural scaling laws approximate a language model’s loss as a power-law function of parameter count $N$ and token count $D$. Following Chinchilla-style compute-optimal training, many studies fit scaling l…

cs.AI, cs.LG

DARE: Diffusion Language Model Activation Reuse for Efficient Inference

/ May 12, 2026

arXiv:2605.08134v1 Announce Type: cross
Abstract: Diffusion Large Language Models (dLLMs) have emerged as a promising alternative to auto-regressive (AR) models, offering greater expressive capacity and potential for parallel generation and faster inf…

cs.HC, cs.LG

Quasi-Linear ICA for Motor Unit Decomposition during Dynamic Contractions

/ May 12, 2026

arXiv:2406.19581v2 Announce Type: replace-cross
Abstract: Decomposing surface electromyography (EMG) into the spike trains of individual motor neurons is a long-standing inverse problem and a key step toward motor-neuron-driven neural interfaces such …

cs.LG, eess.SP

Reliable LLM-Based Edge-Cloud-Expert Cascades for Telecom Knowledge Systems

/ May 12, 2026

arXiv:2512.20012v2 Announce Type: replace-cross
Abstract: Large language models (LLMs) are emerging as key enablers of automation in domains such as telecommunications, assisting with tasks including troubleshooting, standards interpretation, and netw…

cs.AI, cs.GR, cs.LG

MeshFIM: Local Low-Poly Mesh Editing via Fill-in-the-Middle Autoregressive Generation

/ May 12, 2026

arXiv:2605.08744v1 Announce Type: cross
Abstract: Autoregressive (AR) models can generate high-quality low-poly meshes from point clouds, but they still operate in an all-or-nothing manner: when a local region is unsatisfactory, the entire mesh must b…

cs.LG

DataArc-SynData-Toolkit: A Unified Closed-Loop Framework for Multi-Path, Multimodal, and Multilingual Data Synthesis

/ May 12, 2026

arXiv:2605.08138v1 Announce Type: new
Abstract: Synthetic data has emerged as a crucial solution to the data scarcity bottleneck in large language models (LLMs), particularly for specialized domains and low-resource languages. However, the broader ado…

cs.CV

SAMOFT: Robust Multi-Object Tracking via Region and Flow

/ May 12, 2026

arXiv:2605.09417v1 Announce Type: new
Abstract: Multi-object tracking (MOT) is a fundamental task in computer vision that requires continuously tracking multiple targets while maintaining consistent identities across frames. However, most existing app…

cs.AI, cs.IR, cs.LG

UxSID: Semantic-Aware User Interests Modeling for Ultra-Long Sequence

/ May 12, 2026

arXiv:2605.09040v1 Announce Type: new
Abstract: Modeling ultra-long user sequences involves a difficult trade-off between efficiency and effectiveness. While current paradigms rely on either item-specific search or item-agnostic compression, we propos…

cs.CV

GenMed: A Pairwise Generative Reformulation of Medical Diagnostic Tasks

/ May 12, 2026

arXiv:2605.10645v1 Announce Type: new
Abstract: Data-driven medical AI is traditionally formulated as a discriminative mapping from input $X$ to output $Y$ via a learned function $f$, which does not generalize well across heterogeneous data and modali…

cs.AI, cs.CV, cs.LG

NoiseRater: Meta-Learned Noise Valuation for Diffusion Model Training

/ May 12, 2026

arXiv:2605.08144v1 Announce Type: cross
Abstract: Diffusion models have achieved remarkable success across a wide range of generative tasks, yet their training paradigm largely treats injected noise as uniformly informative. In this work, we challenge…