dataset
dr-tulu-rl-data
rl-research
or hover any field below to flag it
Overview
Name
dr-tulu-rl-data
Source
rl-research
Episodes
0
Robot count
0
Format
parquet
Description
[!NOTE]
For full information, go check out the Dr Tulu paper here.
DR Tulu RL Data
This dataset contains the RL training data for DR Tulu, containing prompts and search-based rubrics generated from OpenScholar and SearchArena prompts, with rubrics generated using GPT-4.1.
Important: This does not contain the RaR datasets we use in final RL training, but only the OpenScholar and SearchArena subsets. For the RaR data, we use data from:
anisha2102/RaR-Science-20k-o3-mini… See the full description on the dataset page: https://huggingface.co/datasets/rl-research/dr-tulu-rl-data.
Robots used
null
Links
HuggingFace dataset