dataset
pack_toothbrush_Nov26-advantages
villekuosmanen
or hover any field below to flag it
Overview
Name
pack_toothbrush_Nov26-advantages
Source
villekuosmanen
Episodes
0
Robot count
0
Format
parquet
Description
Advantage Values for villekuosmanen/pack_toothbrush_Nov26
Pre-computed advantage values for offline RL training.
Source
Dataset: villekuosmanen/pack_toothbrush_Nov26
Value Model: villekuosmanen/rewact_toothbrush_pistar_1.5.0
N-step lookahead: 50
Files
This dataset contains per-episode parquet files with advantage values for each frame.
Usage
from pathlib import Path
import pandas as pd
# Load advantages for a specific episode
advantage_df =… See the full description on the dataset page: https://huggingface.co/datasets/villekuosmanen/pack_toothbrush_Nov26-advantages.
Robots used
null
Links
HuggingFace dataset