dataset

RLVR-IFeval

allenai

or hover any field below to flag it

Overview

Name

RLVR-IFeval

Source

allenai

Episodes

Robot count

Format

parquet

Description

IF Data - RLVR Formatted This dataset contains instruction following data formatted for use with open-instruct - specifically reinforcement learning with verifiable rewards. Prompts with verifiable constraints generated by sampling from the Tulu 2 SFT mixture and randomly adding constraints from IFEval. Part of the Tulu 3 release, for which you can see models here and datasets here. Dataset Structure Each example in the dataset contains the standard instruction-tuning… See the full description on the dataset page: https://huggingface.co/datasets/allenai/RLVR-IFeval.

Robots used

null

Links

HuggingFace dataset

allenai/RLVR-IFeval