dataset

PAPO_ViRL39K_train

PAPOGalaxy

or hover any field below to flag it

Overview

Name

PAPO_ViRL39K_train

Source

PAPOGalaxy

Episodes

Robot count

Format

parquet

Description

This is the official release of the training data for paper PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning. Hugging Face Paper: https://huggingface.co/papers/2507.06448 Project page: https://mikewangwzhl.github.io/PAPO/ This dataset is the train split of the training dataset for PAPO. (Optional) To include validate set, you may use our adapted val split PAPOGalaxy/PAPO_MMK12_test. Data Source Training We adapt the multimodal benchmark… See the full description on the dataset page: https://huggingface.co/datasets/PAPOGalaxy/PAPO_ViRL39K_train.

Robots used

null

Links

HuggingFace dataset

PAPOGalaxy/PAPO_ViRL39K_train