← Back to Benchmarks
simmediumatarimetric · varies

Augmenting Policy Learning with Routines Discovered from a Single Demonstration

Description

Humans can abstract prior knowledge from very little data and use it to boost skill learning. In this paper, we propose routine-augmented policy learning (RAPL), which discovers routines composed of primitive actions from a single demonstration and uses discovered routines to augment policy learning. To discover routines from the demonstration, we first abstract routine candidates by identifying grammar over the demonstrated action trajectory. Then, the best routines measured by length and frequ

Source

http://arxiv.org/abs/2012.12469v4