← Back to Benchmarks
simmediummanipulationmetric · varies

TGM-VLA: Task-Guided Mixup for Sampling-Efficient and Robust Robotic Manipulation

Description

The performance of robotic imitation learning is fundamentally limited by data quality and training strategies. Prevalent sampling strategies on RLBench suffer from severe keyframe redundancy and imbalanced temporal distribution, leading to inefficient memory usage and unstable optimization. Moreover, reprojecting point clouds onto multi-view images with a black background--while more efficient than voxel-based methods--often causes dark objects to be indistinguishable and hard to manipulate. In

Source

http://arxiv.org/abs/2603.00615v1