← Back to Benchmarks
simmediummanipulation-datametric · varies

OCRA: Object-Centric Learning with 3D and Tactile Priors for Human-to-Robot Action Transfer

Description

We present OCRA, an Object-Centric framework for video-based human-to-Robot Action transfer that learns directly from human demonstration videos to enable robust manipulation. Object-centric learning emphasizes task-relevant objects and their interactions while filtering out irrelevant background, providing a natural and scalable way to teach robots. OCRA leverages multi-view RGB videos, the state-of-the-art 3D foundation model VGGT, and advanced detection and segmentation models to reconstruct

Source

http://arxiv.org/abs/2603.14401v1