- Provide.ai - Page 5

EVPO: Explained Variance Policy Optimization for Adaptive Critic Utilization in LLM Post-Training

/ April 22, 2026

arXiv:2604.19485v1 Announce Type: new
Abstract: Reinforcement learning (RL) for LLM post-training faces a fundamental design choice: whether to use a learned critic as a baseline for policy optimization. Classical theory favors critic-based methods su…

cs.AI, cs.DB, cs.LG

Revisiting RaBitQ and TurboQuant: A Symmetric Comparison of Methods, Theory, and Experiments

/ April 22, 2026

arXiv:2604.19528v1 Announce Type: new
Abstract: This technical note revisits the relationship between RaBitQ and TurboQuant under a unified comparison framework. We compare the two methods in terms of methodology, theoretical guarantees, and empirical…

cs.LG, cs.NE, q-bio.GN

An Imbalanced Dataset with Multiple Feature Representations for Studying Quality Control of Next-Generation Sequencing

/ April 22, 2026

arXiv:2604.04981v2 Announce Type: replace-cross
Abstract: Next-generation sequencing (NGS) is a key technique for studying the DNA and RNA of organisms. However, identifying quality problems in NGS data across different experimental settings remains c…

cs.CV, cs.LG

On the Generalizability of Foundation Models for Crop Type Mapping

/ April 22, 2026

arXiv:2409.09451v5 Announce Type: replace-cross
Abstract: Foundation models pre-trained using self-supervised learning have shown powerful transfer learning capabilities on various downstream tasks, including language understanding, text generation, a…

cs.AR, cs.LG

A PPA-Driven 3D-IC Partitioning Selection Framework with Surrogate Models

/ April 22, 2026

arXiv:2604.18806v1 Announce Type: new
Abstract: 3D-IC netlist partitioning is commonly optimized using proxy objectives, while final PPA is treated as a costly evaluation rather than an optimization signal. This proxy-driven paradigm makes it difficul…

cs.CV, cs.LG

Centralized Copy-Paste: Enhanced Data Augmentation Strategy for Wildland Fire Semantic Segmentation

/ April 22, 2026

arXiv:2507.06321v2 Announce Type: replace-cross
Abstract: Collecting and annotating images for the purpose of training segmentation models is often cost prohibitive. In the domain of wildland fire science, this challenge is further compounded by the s…

cs.LG, physics.comp-ph

The High Explosives and Affected Targets (HEAT) Dataset

/ April 22, 2026

arXiv:2604.18828v1 Announce Type: new
Abstract: Artificial Intelligence (AI) surrogate models provide a computationally efficient alternative to full-physics simulations, but no public datasets currently exist for training and validating models of hig…

cs.AI, cs.LG

One Step Forward and K Steps Back: Better Reasoning with Denoising Recursion Models

/ April 22, 2026

arXiv:2604.18839v1 Announce Type: new
Abstract: Looped transformers scale computational depth without increasing parameter count by repeatedly applying a shared transformer block and can be used for iterative refinement, where each loop rewrites a ful…

cs.LG, stat.ME

Collaborative Contextual Bayesian Optimization

/ April 22, 2026

arXiv:2604.18912v1 Announce Type: new
Abstract: Discovering optimal designs through sequential data collection is essential in many real-world applications. While Bayesian Optimization (BO) has achieved remarkable success in this setting, growing atte…

cs.AI, cs.LG, hep-ph, hep-th

Fine-Tuning Small Reasoning Models for Quantum Field Theory

/ April 22, 2026

arXiv:2604.18936v1 Announce Type: new
Abstract: Despite the growing application of Large Language Models (LLMs) to theoretical physics, there is little academic exploration into how domain-specific physics reasoning ability develops while training the…