- Provide.ai - Page 4

Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning

/ May 6, 2026

arXiv:2605.02913v1 Announce Type: new
Abstract: Reinforcement learning (RL) has become a central post-training tool for improving the reasoning abilities of large language models (LLMs). In these systems, the rollout, the trajectory sampled from a pro…

cs.RO

A Certifably Correct Algorithm for Generalized Robot-World and Hand-Eye Calibration

/ May 6, 2026

arXiv:2507.23045v2 Announce Type: replace
Abstract: Automatic extrinsic sensor calibration is a fundamental problem for multi-sensor platforms. Reliable and general-purpose solutions should be computationally efficient, require few assumptions about t…

cs.LG

An End-to-End Framework for Building Large Language Models for Software Operations

/ May 6, 2026

arXiv:2605.02906v1 Announce Type: new
Abstract: In the field of software operations, Large Language Models (LLMs) have attracted increasing attention. However, existing research has not yet achieved efficient and effective end-to-end intelligent opera…

cs.AI, cs.LG

PRISM-CTG: A Foundation Model for Cardiotocography Analysis with Multi-View SSL

/ May 6, 2026

arXiv:2605.02917v1 Announce Type: new
Abstract: Supervised deep learning models for automated CTG analysis are typically constrained by narrowly curated labelled datasets and limited patient cohorts, leaving substantial volumes of physiologically info…

cs.CV, cs.LG, cs.RO, cs.SY, eess.SY

A Vision-Based Shared-Control Teleoperation Scheme for Controlling the Robotic Arm of a Four-Legged Robot

/ May 6, 2026

arXiv:2508.14994v3 Announce Type: replace-cross
Abstract: In hazardous and remote environments, robotic systems perform critical tasks demanding improved safety and efficiency. Among these, quadruped robots with manipulator arms offer mobility and ver…

cs.LG, cs.RO, cs.SY, eess.SY

Viewpoint-Agnostic Grasp Pipeline using VLM and Partial Observations

/ May 6, 2026

arXiv:2603.07866v2 Announce Type: replace
Abstract: Robust grasping in cluttered, unstructured environments remains challenging for mobile legged manipulators due to occlusions that lead to partial observations, unreliable depth estimates, and the nee…

cs.AI, cs.LG

Mitigating the reconstruction-detection trade-off in VAE-based unsupervised anomaly detection

/ May 6, 2026

arXiv:2605.02918v1 Announce Type: new
Abstract: Variational autoencoders are widely used for unsupervised anomaly detection. Model selection however remains an open-question: to remain fully unsupervised, hyperparameters are often chosen to minimize t…

cs.RO

OmniUMI: Towards Physically Grounded Robot Learning via Human-Aligned Multimodal Interaction

/ May 6, 2026

arXiv:2604.10647v3 Announce Type: replace
Abstract: UMI-style interfaces enable scalable robot learning, but existing systems remain largely visuomotor, relying primarily on RGB observations and trajectory while providing only limited access to physic…

cs.RO

Height Control and Optimal Torque Planning for Jumping With Wheeled-Bipedal Robots

/ May 6, 2026

arXiv:2605.03302v1 Announce Type: new
Abstract: This paper mainly studies the accurate height jumping control of wheeled-bipedal robots based on torque planning and energy consumption optimization. Due to the characteristics of underactuated, nonlinea…

cond-mat.stat-mech, cs.LG, q-bio.QM

Expanding functional protein sequence space using high entropy generative models

/ May 6, 2026

arXiv:2605.03578v1 Announce Type: cross
Abstract: Boltzmann Machines trained on evolutionary sequence data have emerged as a powerful paradigm for the data-driven design of artificial proteins. However, the relationship between model architecture, spe…