dataset

VPPO_ViRL39K_train

chamber111

or hover any field below to flag it

Overview

Name

VPPO_ViRL39K_train

Source

chamber111

Episodes

Robot count

Format

parquet

Description

Dataset Card for VPPO_ViRL39K_train Dataset Details Dataset Description This dataset is the official training split used to fine-tune the VPPO-7B and VPPO-32B models presented in our paper, "Spotlight on Token Perception for Multimodal Reinforcement Learning". This is a direct copy of the TIGER-Lab/ViRL39K dataset. We have isolated it here to ensure the exact version used in our experiments is publicly available, guaranteeing reproducibility for our research.… See the full description on the dataset page: https://huggingface.co/datasets/chamber111/VPPO_ViRL39K_train.

Robots used

null

Links

HuggingFace dataset

chamber111/VPPO_ViRL39K_train