dataset

HPO-RL-Tiny

Nicknam

or hover any field below to flag it

Overview

Name
HPO-RL-Tiny
Source
Nicknam
Episodes
0
Robot count
0
Format
other
Description
HPO-RL-Bench Minimal Subset (100 configs per pair of algorithm and environment) ⚠️ This is a derivative subset of the official HPO-RL-Bench benchmark. It contains 100 randomly sampled hyperparameter configurations for each of the three core algorithms (DQN, PPO, SAC) to enable rapid prototyping and testing of hyperparameter optimization methods. For the full benchmark , see the original dataset. 📌 Attribution & License This dataset is a derivative work of:… See the full description on the dataset page: https://huggingface.co/datasets/Nicknam/HPO-RL-Tiny.
Robots used
null

Links

HuggingFace dataset