- Provide.ai - Page 147

Deep Learning for Model Calibration in Simulation of Itaconic Acid Production

/ April 27, 2026

arXiv:2604.22496v1 Announce Type: new
Abstract: In this study, deep learning is used to estimate kinetic parameters for modeling itaconic acid production based on real batch experiments conducted at different agitation speeds and reactor scales. Two d…

cs.CL

NiuTrans.LMT: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs

/ April 27, 2026

arXiv:2511.07003v2 Announce Type: replace
Abstract: Large language models have significantly advanced Multilingual Machine Translation (MMT), yet scaling to many languages while keeping quality robust across directions remains challenging. In this pap…

cs.CL, cs.LG

LayerBoost: Layer-Aware Attention Reduction for Efficient LLMs

/ April 27, 2026

arXiv:2604.22050v1 Announce Type: cross
Abstract: Transformers are mostly relying on softmax attention, which introduces quadratic complexity with respect to sequence length and remains a major bottleneck for efficient inference. Prior work on linear …

cs.CE, cs.CL, cs.LG, q-fin.CP, q-fin.PM

Predicting Liquidity-Aware Bond Yields using Causal GANs and Deep Reinforcement Learning with LLM Evaluation

/ April 27, 2026

arXiv:2502.17011v2 Announce Type: replace-cross
Abstract: Financial bond yield forecasting is challenging due to data scarcity, nonlinear macroeconomic dependencies, and evolving market conditions. In this paper, we propose a novel framework that leve…

cs.LG

When Quotes Crumble: Detecting Transient Mechanical Liquidity Erosion in Limit Order Books

/ April 27, 2026

arXiv:2604.21993v1 Announce Type: new
Abstract: We study the detection of transient liquidity erosion (“crumbling quotes”) in electronic limit order books, where observable quote deterioration may reflect either mechanical liquidity withdrawal or info…

cs.LG

PreMoE: Proactive Inference for Efficient Mixture-of-Experts

/ April 27, 2026

arXiv:2505.17639v3 Announce Type: replace
Abstract: Mixture-of-Experts (MoE) models offer dynamic computation, but are typically deployed as static full-capacity models, missing opportunities for deployment-specific specialization. We introduce PreMoE…

cs.LG

Artificial intelligence for methane detection: from continuous monitoring to verified mitigation

/ April 27, 2026

arXiv:2511.21777v3 Announce Type: replace
Abstract: Methane is a potent greenhouse gas, responsible for roughly 30% of warming since pre-industrial times. A small number of large point sources account for a disproportionate share of emissions, creatin…

cs.CV, cs.LG, eess.IV

Useful nonrobust features are ubiquitous in biomedical images

/ April 27, 2026

arXiv:2604.22579v1 Announce Type: cross
Abstract: We study whether deep networks for medical imaging learn useful nonrobust features – predictive input patterns that are not human interpretable and highly susceptible to small adversarial perturbations…

cs.LG

Toward Robust and Efficient ML-Based GPU Caching for Modern Inference

/ April 27, 2026

arXiv:2509.20979v2 Announce Type: replace
Abstract: In modern GPU inference, cache efficiency remains a major bottleneck, and heuristic policies such as \textsc{LRU} can perform far worse than the offline optimum. Existing learning-based caching syste…

cs.LG

Leveraging Teleconnections with Physics-Informed Graph Attention Networks for Long-Range Extreme Rainfall Forecasting in Thailand

/ April 27, 2026

arXiv:2510.12328v5 Announce Type: replace
Abstract: Accurate rainfall forecasting, particularly for extreme events, remains a significant challenge in climatology and the Earth system. This paper presents novel physics-informed Graph Neural Networks (…