dataset

dr-tulu-rl-data

rl-research

or hover any field below to flag it

Overview

Name
dr-tulu-rl-data
Source
rl-research
Episodes
0
Robot count
0
Format
parquet
Description
[!NOTE] For full information, go check out the Dr Tulu paper here. DR Tulu RL Data This dataset contains the RL training data for DR Tulu, containing prompts and search-based rubrics generated from OpenScholar and SearchArena prompts, with rubrics generated using GPT-4.1. Important: This does not contain the RaR datasets we use in final RL training, but only the OpenScholar and SearchArena subsets. For the RaR data, we use data from: anisha2102/RaR-Science-20k-o3-mini… See the full description on the dataset page: https://huggingface.co/datasets/rl-research/dr-tulu-rl-data.
Robots used
null

Links

HuggingFace dataset