- Provide.ai - Page 29

Mitigating Label Shift in Tabular In-Context Learning via Test-Time Posterior Adjustment

/ May 7, 2026

arXiv:2605.04363v1 Announce Type: new
Abstract: TabPFN has recently gained attention as a foundation model for tabular datasets, achieving strong performance by leveraging in-context learning on synthetic data. However, we find that TabPFN is vulnerab…

cs.AI

VCBench: Benchmarking LLMs in Venture Capital

/ May 7, 2026

arXiv:2509.14448v2 Announce Type: replace
Abstract: Benchmarks such as SWE-bench and ARC-AGI demonstrate how shared datasets accelerate progress toward artificial general intelligence (AGI). We introduce VCBench, the first benchmark for predicting fou…

cs.AI, cs.LG, stat.ML

When LLMs get significantly worse: A statistical approach to detect model degradations

/ May 7, 2026

arXiv:2602.10144v2 Announce Type: replace
Abstract: Minimizing the inference cost and latency of foundation models has become a crucial area of research. Optimization approaches include theoretically lossless methods and others without accuracy guaran…

cs.CL, cs.LG

Conceptors for Semantic Steering

/ May 7, 2026

arXiv:2605.04980v1 Announce Type: new
Abstract: Activation-based steering provides control of LLM behavior at inference time, but the dominant paradigm reduces each concept to a single direction whose geometry is left largely unexamined. Rather than s…

cs.AI, cs.DC, cs.LG, cs.NE

Online Continual Learning on Intel Loihi 2 via a Co-designed Spiking Neural Network

/ May 7, 2026

arXiv:2511.01553v2 Announce Type: replace
Abstract: AI systems on edge devices require online continual learning — adapting to non-stationary streams and unfamiliar classes without catastrophic forgetting — under strict power constraints. We present…

cs.LG

Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

/ May 7, 2026

arXiv:2602.02958v5 Announce Type: replace
Abstract: Despite rapid progress in autoregressive video diffusion, an emerging system algorithm bottleneck limits both deployability and generation capability: KV cache memory. In autoregressive video generat…

cs.AI, cs.LG

Federated Learning for Early Prediction of EV Charging Demand

/ May 7, 2026

arXiv:2605.04993v1 Announce Type: new
Abstract: Accurate forecasting of electric vehicle (EV) charging demand is critical for grid stability, infrastructure planning, and real-time charging optimization. In this work, we study the problem of early pre…

cs.LG, cs.SY, eess.SY

How Does the Lagrangian Guide Safe Reinforcement Learning through Diffusion Models?

/ May 7, 2026

arXiv:2602.02924v2 Announce Type: replace
Abstract: Diffusion policy sampling enables reinforcement learning (RL) to represent multimodal action distributions beyond suboptimal unimodal Gaussian policies. However, existing diffusion-based RL methods p…

cs.CL, cs.LG

DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training

/ May 7, 2026

arXiv:2602.05890v2 Announce Type: replace
Abstract: Training reinforcement learning (RL) systems in real-world environments remains challenging due to noisy supervision and poor out-of-domain (OOD) generalization, especially in LLM post-training. Rece…

cs.LG

CuBridge: An LLM-Based Framework for Understanding and Reconstructing High-Performance Attention Kernels

/ May 7, 2026

arXiv:2605.05023v1 Announce Type: new
Abstract: Efficient CUDA implementations of attention mechanisms are critical to modern deep learning systems, yet supporting diverse and evolving attention variants remains challenging. Existing frameworks and co…