← Back to Benchmarks
simmediumatarimetric · varies

ToyBox: Better Atari Environments for Testing Reinforcement Learning Agents

Description

It is a widely accepted principle that software without tests has bugs. Testing reinforcement learning agents is especially difficult because of the stochastic nature of both agents and environments, the complexity of state-of-the-art models, and the sequential nature of their predictions. Recently, the Arcade Learning Environment (ALE) has become one of the most widely used benchmark suites for deep learning research, and state-of-the-art Reinforcement Learning (RL) agents have been shown to ro

Source

http://arxiv.org/abs/1812.02850v3