dataset
MMPR-Tiny
OpenGVLab
or hover any field below to flag it
Overview
Name
MMPR-Tiny
Source
OpenGVLab
Episodes
0
Robot count
0
Format
other
Description
MMPR-Tiny
This is the training data used during the online RL stage of InternVL3.5, which greatly improves the overall performance of InternVL3.5 across all scales. Our training code is also open-sourced.
Based on MMPR-v1.2, we compute the accuracy of each query using the provided rollouts and select those whose model accuracy falls between 0.2 and 0.8 for online RL.
We further extend the dataset with recent multimodal datasets to enhance diversity.
Please refer to our paper for… See the full description on the dataset page: https://huggingface.co/datasets/OpenGVLab/MMPR-Tiny.
Robots used
null
Links
HuggingFace dataset