- Provide.ai - Page 55

Towards Generalizable Reasoning: Group Causal Counterfactual Policy Optimization for LLM Reasoning

/ May 14, 2026

arXiv:2602.06475v2 Announce Type: replace
Abstract: Large language models (LLMs) excel at complex tasks with advances in reasoning capabilities. However, existing reward mechanisms remain tightly coupled to final correctness and pay little attention t…

cs.AI, cs.LG, cs.SE

EGSS: Entropy-guided Stepwise Scaling for Reliable Software Engineering

/ May 14, 2026

arXiv:2602.05242v1 Announce Type: cross
Abstract: Agentic Test-Time Scaling (TTS) has delivered state-of-the-art (SOTA) performance on complex software engineering tasks such as code generation and bug fixing. However, its practical adoption remains l…

cs.CR, cs.CV, cs.HC

ThermalTap: Passive Application Fingerprinting in VR Headsets via Thermal Side Channels

/ May 14, 2026

arXiv:2605.12927v1 Announce Type: cross
Abstract: Standalone virtual reality (VR) headsets process highly sensitive personal, professional, and health-related data, yet their susceptibility to non-contact physical side channels remains largely unexplo…

cs.CV

SciVQR: A Multidisciplinary Multimodal Benchmark for Advanced Scientific Reasoning Evaluation

/ May 14, 2026

arXiv:2605.10187v2 Announce Type: replace
Abstract: Scientific reasoning is a key aspect of human intelligence, requiring the integration of multimodal inputs, domain expertise, and multi-step inference across various subjects. Existing benchmarks for…

astro-ph.EP, astro-ph.IM, cs.LG

Earth Science Foundation Models: From Perception to Reasoning and Discovery

/ May 14, 2026

arXiv:2605.12542v1 Announce Type: cross
Abstract: Large foundation models (FMs) are transforming Earth science by integrating heterogeneous multimodal data, such as multi-platform imagery, gridded reanalysis data, diverse geophysical and geochemical o…

cs.CV, cs.LG

Bias In, Bias Out? Finding Unbiased Subnetworks in Vanilla Models

/ May 14, 2026

arXiv:2603.05582v2 Announce Type: replace
Abstract: The issue of algorithmic biases in deep learning has led to the development of various debiasing techniques, many of which perform complex training procedures or dataset manipulation. However, an int…

cs.LG, physics.ao-ph

Data-Driven Integration Kernels for Interpretable Nonlocal Operator Learning

/ May 14, 2026

arXiv:2603.10305v3 Announce Type: replace
Abstract: Machine learning models can represent climate processes that are nonlocal in horizontal space, height, and time, often by combining information across these dimensions in highly nonlinear ways. While…

cs.LG

MIDST Challenge at SaTML 2025: Membership Inference over Diffusion-models-based Synthetic Tabular data

/ May 14, 2026

arXiv:2603.19185v2 Announce Type: replace
Abstract: Synthetic data is often perceived as a silver-bullet solution to data anonymization and privacy-preserving data publishing. Drawn from generative models like diffusion models, synthetic data is expec…

cs.CV

Pyramid Forcing: Head-Aware Pyramid KV Cache Policy for High-Quality Long Video Generation

/ May 14, 2026

arXiv:2605.13111v1 Announce Type: new
Abstract: Autoregressive video generation enables streaming and open-ended long video synthesis, but still suffers from long-term degradation caused by accumulated errors. Existing KVCache strategies usually apply…

cs.LG

Building Interactive Real-Time Agents with Asynchronous I/O and Speculative Tool Calling

/ May 14, 2026

arXiv:2605.13360v1 Announce Type: new
Abstract: There is a growing demand for agentic AI technologies for a range of downstream applications like customer service and personal assistants. For applications where the agent needs to interact with a perso…