simmediumimitationmetric · varies

IN-RIL: Interleaved Reinforcement and Imitation Learning for Policy Fine-Tuning

Description

Imitation learning (IL) and reinforcement learning (RL) each offer distinct advantages for robotics policy learning: IL provides stable learning from demonstrations, and RL promotes generalization through exploration. While existing robot learning approaches using IL-based pre-training followed by RL-based fine-tuning are promising, this two-step learning paradigm often suffers from instability and poor sample efficiency during the RL fine-tuning phase. In this work, we introduce IN-RIL, INterle

Source

http://arxiv.org/abs/2505.10442v1