- Provide.ai - Page 131

Instance Awareness of Multi-class Semantic Segmentation Loss Functions

/ April 28, 2026

arXiv:2604.24276v1 Announce Type: new
Abstract: Instance-sensitive losses for semantic segmentation such as blob loss and CC loss were designed to address instance imbalance, ensuring small lesions generate the same gradient as large ones, but operate…

cs.AI

Transferable Human Mobility Network Reconstruction with neuroGravity

/ April 28, 2026

arXiv:2604.23678v1 Announce Type: new
Abstract: Accurate modeling of human mobility is critical for tackling urban planning and public health challenges. In undeveloped regions, the absence of comprehensive travel surveys necessitates reconstructing m…

cs.LG, q-bio.NC

Integrative neurocybernetic modeling in the era of large-scale neuroscience

/ April 28, 2026

arXiv:2604.23903v1 Announce Type: cross
Abstract: Large-scale neuroscience is generating rich datasets across animals, brain areas and behavioral contexts, yet our modeling efforts remains fragmented across isolated experiments. We argue that understa…

cs.LG

Estimating Dense-Packed Zone Height in Liquid-Liquid Separation: A Physics-Informed Neural Network Approach

/ April 28, 2026

arXiv:2601.18399v2 Announce Type: replace
Abstract: Separating liquid-liquid dispersions in gravity settlers is critical in chemical, pharmaceutical, and recycling processes. The dense-packed zone height is an important performance and safety indicato…

Artificial Intelligence, Microsoft, Vendors and Providers

Microsoft, OpenAI change contract terms — again

/ April 28, 2026

Microsoft and OpenAI on Monday again revised their agreement, softening their exclusivity and revenue-sharing conditions in the process. These changes underscore how critical it is for enterprises to work with as many AI vendors …

cs.CL

Aggregate vs. Personalized Judges in Business Idea Evaluation: Evidence from Expert Disagreement

/ April 27, 2026

arXiv:2604.22517v1 Announce Type: new
Abstract: Evaluating LLM-generated business ideas is often harder to scale than generating them. Unlike standard NLP benchmarks, business idea evaluation relies on multi-dimensional criteria such as feasibility, n…

cs.CL

RouteLMT: Learned Sample Routing for Hybrid LLM Translation Deployment

/ April 27, 2026

arXiv:2604.22520v1 Announce Type: new
Abstract: Large Language Models (LLMs) have achieved remarkable performance in Machine Translation (MT), but deploying them at scale remains prohibitively expensive. A widely adopted remedy is the hybrid system pa…

cs.CL, cs.LG

Multi-Token Prediction via Self-Distillation

/ April 27, 2026

arXiv:2602.06019v2 Announce Type: replace
Abstract: Existing techniques for accelerating language model inference, such as speculative decoding, require training auxiliary speculator models and building and deploying complex inference pipelines. We co…

cs.AI, cs.CL, cs.CY, cs.HC, cs.SE

How Do AI Agents Spend Your Money? Analyzing and Predicting Token Consumption in Agentic Coding Tasks

/ April 27, 2026

arXiv:2604.22750v1 Announce Type: cross
Abstract: The wide adoption of AI agents in complex human workflows is driving rapid growth in LLM token consumption. When agents are deployed on tasks that require a significant amount of tokens, three question…

cs.CL

Why Supervised Fine-Tuning Fails to Learn: A Systematic Study of Incomplete Learning in Large Language Models

/ April 27, 2026

arXiv:2604.10079v4 Announce Type: replace
Abstract: Supervised Fine-Tuning (SFT) is the standard approach for adapting large language models (LLMs) to downstream tasks. However, we observe a persistent failure mode: even after convergence, models ofte…