dataset

HPO-RL-Tiny

Nicknam

or hover any field below to flag it

Overview

Name

HPO-RL-Tiny

Source

Nicknam

Episodes

Robot count

Format

other

Description

HPO-RL-Bench Minimal Subset (100 configs per pair of algorithm and environment) ⚠️ This is a derivative subset of the official HPO-RL-Bench benchmark. It contains 100 randomly sampled hyperparameter configurations for each of the three core algorithms (DQN, PPO, SAC) to enable rapid prototyping and testing of hyperparameter optimization methods. For the full benchmark , see the original dataset. 📌 Attribution & License This dataset is a derivative work of:… See the full description on the dataset page: https://huggingface.co/datasets/Nicknam/HPO-RL-Tiny.

Robots used

null

Links

HuggingFace dataset

Nicknam/HPO-RL-Tiny