← Back to Benchmarks
simmediumatarimetric · varies

Improving Token-Based World Models with Parallel Observation Prediction

Description

Motivated by the success of Transformers when applied to sequences of discrete symbols, token-based world models (TBWMs) were recently proposed as sample-efficient methods. In TBWMs, the world model consumes agent experience as a language-like sequence of tokens, where each observation constitutes a sub-sequence. However, during imagination, the sequential token-by-token generation of next observations results in a severe bottleneck, leading to long training times, poor GPU utilization, and limi

Source

http://arxiv.org/abs/2402.05643v5