dataset
HPO-RL-Tiny
Nicknam
or hover any field below to flag it
Overview
Name
HPO-RL-Tiny
Source
Nicknam
Episodes
0
Robot count
0
Format
other
Description
HPO-RL-Bench Minimal Subset (100 configs per pair of algorithm and environment)
⚠️ This is a derivative subset of the official HPO-RL-Bench benchmark. It contains 100 randomly sampled hyperparameter configurations for each of the three core algorithms (DQN, PPO, SAC) to enable rapid prototyping and testing of hyperparameter optimization methods.
For the full benchmark , see the original dataset.
📌 Attribution & License
This dataset is a derivative work of:… See the full description on the dataset page: https://huggingface.co/datasets/Nicknam/HPO-RL-Tiny.
Robots used
null
Links
HuggingFace dataset