dataset
RLVR-GSM
allenai
or hover any field below to flag it
Overview
Name
RLVR-GSM
Source
allenai
Episodes
0
Robot count
0
Format
parquet
Description
GSM8k Data - RLVR Formatted
This dataset contains the GSM8k dataset formatted for use with open-instruct - specifically reinforcement learning with verifiable rewards.
Part of the Tulu 3 release, for which you can see models here and datasets here.
Dataset Structure
Each example in the dataset contains the standard instruction-tuning data points as follow:
messages (list): inputs used to prompt the model (after chat template formatting).
ground_truth (str): the… See the full description on the dataset page: https://huggingface.co/datasets/allenai/RLVR-GSM.
Robots used
null
Links
HuggingFace dataset