dataset

Dolci-Think-RL-7B

allenai

or hover any field below to flag it

Overview

Name
Dolci-Think-RL-7B
Source
allenai
Episodes
0
Robot count
0
Format
parquet
Description
Dolci-Think-RL-7B Dataset Summary Dolci-Think-RL-7B is the reinforcement learning dataset used to train the Olmo-3-7B-Think model.It contains 102,014 prompts designed to elicit deep reasoning across: Math Coding Precise Instruction Following General Chat It blends high-quality curated sources with filtering designed for deliberate reasoning. Dataset Composition Total Samples: 102,014 Original Dataset Contribution… See the full description on the dataset page: https://huggingface.co/datasets/allenai/Dolci-Think-RL-7B.
Robots used
null

Links

HuggingFace dataset