dataset

RLVR-IFeval

allenai

or hover any field below to flag it

Overview

Name
RLVR-IFeval
Source
allenai
Episodes
0
Robot count
0
Format
parquet
Description
IF Data - RLVR Formatted This dataset contains instruction following data formatted for use with open-instruct - specifically reinforcement learning with verifiable rewards. Prompts with verifiable constraints generated by sampling from the Tulu 2 SFT mixture and randomly adding constraints from IFEval. Part of the Tulu 3 release, for which you can see models here and datasets here. Dataset Structure Each example in the dataset contains the standard instruction-tuning… See the full description on the dataset page: https://huggingface.co/datasets/allenai/RLVR-IFeval.
Robots used
null

Links

HuggingFace dataset