Provide.ai - We Provide AI To Companies

LaST$_{0}$: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model

/ March 31, 2026

arXiv:2601.05248v3 Announce Type: replace
Abstract: Vision-Language-Action (VLA) models have recently shown strong generalization, with some approaches seeking to explicitly generate linguistic reasoning traces or predict future observations prior to …

cs.RO

Active Stereo-Camera Outperforms Multi-Sensor Setup in ACT Imitation Learning for Humanoid Manipulation

Robin K\"uhn, Moritz Schappler, Thomas Seel, Dennis Bank / March 31, 2026

arXiv:2603.28422v1 Announce Type: new
Abstract: The complexity of teaching humanoid robots new tasks is one of the major reasons hindering their widespread adoption in the industry. While Imitation Learning (IL), particularly Action Chunking with Tran…

cs.RO

CycleManip: Enabling Cyclic Task Manipulation via Effective Historical Perception and Understanding

Yi-Lin Wei, Haoran Liao, Yuhao Lin, Pengyue Wang, Zhizhao Liang, Guiliang Liu, Wei-Shi Zheng / March 31, 2026

arXiv:2512.01022v2 Announce Type: replace
Abstract: In this paper, we explore an important yet underexplored task in robot manipulation: cycle-based manipulation, where robots need to perform cyclic or repetitive actions with an expected terminal time…

cond-mat.mtrl-sci, cond-mat.soft, cs.RO, physics.app-ph

A Foldable and Agile Soft Electromagnetic Robot for Multimodal Navigation in Confined and Unstructured Environments

/ March 31, 2026

arXiv:2603.28362v1 Announce Type: new
Abstract: Multimodal locomotion is crucial for an animal’s adaptability in unstructured wild environments. Similarly, in the human gastrointestinal tract, characterized by viscoelastic mucus, complex rugae, and na…

cs.AI, cs.CL, cs.CV, cs.LG, cs.RO

ViPRA: Video Prediction for Robot Actions

Sandeep Routray, Hengkai Pan, Unnat Jain, Shikhar Bahl, Deepak Pathak / March 31, 2026

arXiv:2511.07732v2 Announce Type: replace
Abstract: Can we turn a video prediction model into a robot policy? Videos, including those of humans or teleoperated robots, capture rich physical interactions. However, most of them lack labeled actions, whi…

cs.HC, cs.RO

Proposing a Game Theory Approach to Explore Group Dynamics with Social Robot

Giulia Pusceddu / March 31, 2026

arXiv:2603.28348v1 Announce Type: new
Abstract: Integrating social robots in our group-based society, beyond the technical challenges, requires considering the social group dynamics. Following the results from preliminary exploratory studies on the in…

cs.RO

EgoDemoGen: Egocentric Demonstration Generation for Viewpoint Generalization in Robotic Manipulation

/ March 31, 2026

arXiv:2509.22578v2 Announce Type: replace
Abstract: Imitation learning based visuomotor policies have achieved strong performance in robotic manipulation, yet they often remain sensitive to egocentric viewpoint shifts. Unlike third-person viewpoint ch…

cs.RO

Point of View: How Perspective Affects Perceived Robot Sociability

Subham Agrawal, Aftab Akthar, Nils Dengler, Maren Bennewitz / March 31, 2026

arXiv:2603.28272v1 Announce Type: new
Abstract: Ensuring that robot navigation is safe and socially acceptable is crucial for comfortable human-robot interaction in shared environments. However, existing validation methods often rely on a bird’s-eye (…

cs.RO

Omni-LIVO: Robust RGB-Colored Multi-Camera Visual-Inertial-LiDAR Odometry via Photometric Migration and ESIKF Fusion

/ March 31, 2026

arXiv:2509.15673v5 Announce Type: replace
Abstract: Wide field-of-view (FoV) LiDAR sensors provide dense geometry across large environments, but existing LiDAR-inertial-visual odometry (LIVO) systems generally rely on a single camera, limiting their a…

cs.RO

osmAG-Nav: A Hierarchical Semantic Topometric Navigation Stack for Robust Lifelong Indoor Autonomy

Yongqi Zhang, Jiajie Zhang, Chengqian Li, Fujing Xie, S\"oren Schwertfeger / March 31, 2026

arXiv:2603.28271v1 Announce Type: new
Abstract: The deployment of mobile robots in large-scale, multi-floor environments demands navigation systems that achieve spatial scalability without compromising local kinematic precision. Traditional navigation…