← Back to Benchmarks
simmediumroboticsmetric · varies

ProgressVLA: Progress-Guided Diffusion Policy for Vision-Language Robotic Manipulation

Description

Most existing vision-language-action (VLA) models for robotic manipulation lack progress awareness, typically relying on hand-crafted heuristics for task termination. This limitation is particularly severe in long-horizon tasks involving cascaded sub-goals. In this work, we investigate the estimation and integration of task progress, proposing a novel model named {\textbf \vla}. Our technical contributions are twofold: (1) \emph{robust progress estimation}: We pre-train a progress estimator on l

Source

http://arxiv.org/abs/2603.27670v1