dataset
RLVR-GSM-MATH-IF-Mixed-Constraints
allenai
or hover any field below to flag it
Overview
Name
RLVR-GSM-MATH-IF-Mixed-Constraints
Source
allenai
Episodes
0
Robot count
0
Format
parquet
Description
GSM/MATH/IF Data - RLVR Formatted
Note that this collection is licensed under ODC-BY-1.0 license; different licenses apply to subsets of the data.
This dataset contains data formatted for use with open-instruct - specifically reinforcement learning with verifiable rewards.
It was used to train the final Tulu 3 models with RL, and contains the following subsets:
GSM8k (7,473 samples): The GSM8k train set formatted for use with RLVR and open-instruct. MIT License.
MATH (7,500… See the full description on the dataset page: https://huggingface.co/datasets/allenai/RLVR-GSM-MATH-IF-Mixed-Constraints.
Robots used
null
Links
HuggingFace dataset