- Provide.ai - Page 451

WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback

/ April 20, 2026

arXiv:2408.15549v4 Announce Type: replace
Abstract: As large language models (LLMs) continue to advance, aligning these models with human preferences has emerged as a critical challenge. Traditional alignment methods, relying on human or LLM annotated…

cs.CV

DualTrack: Sensorless 3D Ultrasound needs Local and Global Context

/ April 20, 2026

arXiv:2509.09530v2 Announce Type: replace
Abstract: Three-dimensional ultrasound (US) offers many clinical advantages over conventional 2D imaging, yet its widespread adoption is limited by the cost and complexity of traditional 3D systems. Sensorless…

cs.CV

AHS: Adaptive Head Synthesis via Synthetic Data Augmentations

/ April 20, 2026

arXiv:2604.15857v1 Announce Type: new
Abstract: Recent digital media advancements have created increasing demands for sophisticated portrait manipulation techniques, particularly head swapping, where one’s head is seamlessly integrated with another’s …

cs.AI, cs.CV

UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards

/ April 20, 2026

arXiv:2604.14967v2 Announce Type: replace-cross
Abstract: Retrieval-Augmented Generation (RAG) extends Large Vision-Language Models (LVLMs) with external visual knowledge. However, existing visual RAG systems typically rely on generic retrieval signal…

cs.CV

Splats in Splats++: Robust and Generalizable 3D Gaussian Splatting Steganography

/ April 20, 2026

arXiv:2604.15862v1 Announce Type: new
Abstract: 3D Gaussian Splatting (3DGS) has recently redefined the paradigm of 3D reconstruction, striking an unprecedented balance between visual fidelity and computational efficiency. As its adoption proliferates…

cs.CL

Faithfulness-Aware Uncertainty Quantification for Fact-Checking the Output of Retrieval Augmented Generation

/ April 20, 2026

arXiv:2505.21072v4 Announce Type: replace
Abstract: Large Language Models (LLMs) enhanced with retrieval, an approach known as Retrieval-Augmented Generation (RAG), have achieved strong performance in open-domain question answering. However, RAG remai…

cs.CV

Towards Design Compositing

/ April 20, 2026

arXiv:2604.14605v2 Announce Type: replace
Abstract: Graphic design creation involves harmoniously assembling multimodal components such as images, text, logos, and other visual assets collected from diverse sources, into a visually-appealing and cohes…

cs.CV

CXR-LT 2026 Challenge: Multi-Center Long-Tailed and Zero Shot Chest X-ray Classification

/ April 20, 2026

arXiv:2604.15555v1 Announce Type: new
Abstract: Chest X-ray (CXR) interpretation is hindered by the long-tailed distribution of pathologies and the open-world nature of clinical environments. Existing benchmarks often rely on closed-set classes from a…

cs.AI, cs.CV

CLIMB: Controllable Longitudinal Brain Image Generation using Mamba-based Latent Diffusion Model and Gaussian-aligned Autoencoder

/ April 20, 2026

arXiv:2604.15611v1 Announce Type: cross
Abstract: Latent diffusion models have emerged as powerful generative models in medical imaging, enabling the synthesis of high quality brain magnetic resonance imaging scans. In particular, predicting the evolu…

cs.CL

ATTNPO: Attention-Guided Process Supervision for Efficient Reasoning

/ April 20, 2026

arXiv:2602.09953v2 Announce Type: replace
Abstract: Large reasoning models trained with reinforcement learning and verifiable rewards (RLVR) achieve strong performance on complex reasoning tasks, yet often overthink, generating redundant reasoning wit…