simmediumatarimetric · varies

StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement Learning

Description

Reinforcement Learning (RL) can be considered as a sequence modeling task: given a sequence of past state-action-reward experiences, an agent predicts a sequence of next actions. In this work, we propose State-Action-Reward Transformer (StARformer) for visual RL, which explicitly models short-term state-action-reward representations (StAR-representations), essentially introducing a Markovian-like inductive bias to improve long-term modeling. Our approach first extracts StAR-representations by se

Source

http://arxiv.org/abs/2110.06206v3