dataset

PAPO_ViRL39K_train

PAPOGalaxy

or hover any field below to flag it

Overview

Name
PAPO_ViRL39K_train
Source
PAPOGalaxy
Episodes
0
Robot count
0
Format
parquet
Description
This is the official release of the training data for paper PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning. Hugging Face Paper: https://huggingface.co/papers/2507.06448 Project page: https://mikewangwzhl.github.io/PAPO/ This dataset is the train split of the training dataset for PAPO. (Optional) To include validate set, you may use our adapted val split PAPOGalaxy/PAPO_MMK12_test. Data Source Training We adapt the multimodal benchmark… See the full description on the dataset page: https://huggingface.co/datasets/PAPOGalaxy/PAPO_ViRL39K_train.
Robots used
null

Links