dataset
PAPO_ViRL39K_train
PAPOGalaxy
or hover any field below to flag it
Overview
Name
PAPO_ViRL39K_train
Source
PAPOGalaxy
Episodes
0
Robot count
0
Format
parquet
Description
This is the official release of the training data for paper PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning.
Hugging Face Paper: https://huggingface.co/papers/2507.06448
Project page: https://mikewangwzhl.github.io/PAPO/
This dataset is the train split of the training dataset for PAPO.
(Optional) To include validate set, you may use our adapted val split PAPOGalaxy/PAPO_MMK12_test.
Data Source
Training
We adapt the multimodal benchmark… See the full description on the dataset page: https://huggingface.co/datasets/PAPOGalaxy/PAPO_ViRL39K_train.
Robots used
null
Links
HuggingFace dataset