← Back to Benchmarks
simmediumimitationmetric · varies

LUMOS: Language-Conditioned Imitation Learning with World Models

Description

We introduce LUMOS, a language-conditioned multi-task imitation learning framework for robotics. LUMOS learns skills by practicing them over many long-horizon rollouts in the latent space of a learned world model and transfers these skills zero-shot to a real robot. By learning on-policy in the latent space of the learned world model, our algorithm mitigates policy-induced distribution shift which most offline imitation learning methods suffer from. LUMOS learns from unstructured play data with

Source

http://arxiv.org/abs/2503.10370v1