← Back to Benchmarks
simmediumatarimetric · varies

STORI: A Benchmark and Taxonomy for Stochastic Environments

Description

Reinforcement learning (RL) techniques have achieved impressive performance on simulated benchmarks such as Atari100k, yet recent advances remain largely confined to simulation and show limited transfer to real-world domains. A central obstacle is environmental stochasticity, as real systems involve noisy observations, unpredictable dynamics, and non-stationary conditions that undermine the stability of current methods. Existing benchmarks rarely capture these uncertainties and favor simplified

Source

http://arxiv.org/abs/2509.01793v2