dataset

MMPR-Tiny

OpenGVLab

or hover any field below to flag it

Overview

Name
MMPR-Tiny
Source
OpenGVLab
Episodes
0
Robot count
0
Format
other
Description
MMPR-Tiny This is the training data used during the online RL stage of InternVL3.5, which greatly improves the overall performance of InternVL3.5 across all scales. Our training code is also open-sourced. Based on MMPR-v1.2, we compute the accuracy of each query using the provided rollouts and select those whose model accuracy falls between 0.2 and 0.8 for online RL. We further extend the dataset with recent multimodal datasets to enhance diversity. Please refer to our paper for… See the full description on the dataset page: https://huggingface.co/datasets/OpenGVLab/MMPR-Tiny.
Robots used
null

Links

HuggingFace dataset