← Back to Benchmarks
simmediummanipulation-datametric · varies
OCRA: Object-Centric Learning with 3D and Tactile Priors for Human-to-Robot Action Transfer
Description
We present OCRA, an Object-Centric framework for video-based human-to-Robot Action transfer that learns directly from human demonstration videos to enable robust manipulation. Object-centric learning emphasizes task-relevant objects and their interactions while filtering out irrelevant background, providing a natural and scalable way to teach robots. OCRA leverages multi-view RGB videos, the state-of-the-art 3D foundation model VGGT, and advanced detection and segmentation models to reconstruct