← Back to Benchmarks
simmediumimitationmetric · varies

ManiFlow: A General Robot Manipulation Policy via Consistency Flow Training

Description

This paper introduces ManiFlow, a visuomotor imitation learning policy for general robot manipulation that generates precise, high-dimensional actions conditioned on diverse visual, language and proprioceptive inputs. We leverage flow matching with consistency training to enable high-quality dexterous action generation in just 1-2 inference steps. To handle diverse input modalities efficiently, we propose DiT-X, a diffusion transformer architecture with adaptive cross-attention and AdaLN-Zero co

Source

http://arxiv.org/abs/2509.01819v1