← Back to Benchmarks
simmediummanipulation-datametric · varies

LoLA: Long Horizon Latent Action Learning for General Robot Manipulation

Description

The capability of performing long-horizon, language-guided robotic manipulation tasks critically relies on leveraging historical information and generating coherent action sequences. However, such capabilities are often overlooked by existing Vision-Language-Action (VLA) models. To solve this challenge, we propose LoLA (Long Horizon Latent Action Learning), a framework designed for robot manipulation that integrates long-term multi-view observations and robot proprioception to enable multi-step

Source

http://arxiv.org/abs/2512.20166v1