dataset

VPPO_ViRL39K_train

chamber111

or hover any field below to flag it

Overview

Name
VPPO_ViRL39K_train
Source
chamber111
Episodes
0
Robot count
0
Format
parquet
Description
Dataset Card for VPPO_ViRL39K_train Dataset Details Dataset Description This dataset is the official training split used to fine-tune the VPPO-7B and VPPO-32B models presented in our paper, "Spotlight on Token Perception for Multimodal Reinforcement Learning". This is a direct copy of the TIGER-Lab/ViRL39K dataset. We have isolated it here to ensure the exact version used in our experiments is publicly available, guaranteeing reproducibility for our research.… See the full description on the dataset page: https://huggingface.co/datasets/chamber111/VPPO_ViRL39K_train.
Robots used
null

Links