simmediumatarimetric · varies

Hieros: Hierarchical Imagination on Structured State Space Sequence World Models

Description

One of the biggest challenges to modern deep reinforcement learning (DRL) algorithms is sample efficiency. Many approaches learn a world model in order to train an agent entirely in imagination, eliminating the need for direct environment interaction during training. However, these methods often suffer from either a lack of imagination accuracy, exploration capabilities, or runtime efficiency. We propose Hieros, a hierarchical policy that learns time abstracted world representations and imagines

Source

http://arxiv.org/abs/2310.05167v3