dataset

Dolci-Think-RL-7B

allenai

or hover any field below to flag it

Overview

Name

Dolci-Think-RL-7B

Source

allenai

Episodes

Robot count

Format

parquet

Description

Dolci-Think-RL-7B Dataset Summary Dolci-Think-RL-7B is the reinforcement learning dataset used to train the Olmo-3-7B-Think model.It contains 102,014 prompts designed to elicit deep reasoning across: Math Coding Precise Instruction Following General Chat It blends high-quality curated sources with filtering designed for deliberate reasoning. Dataset Composition Total Samples: 102,014 Original Dataset Contribution… See the full description on the dataset page: https://huggingface.co/datasets/allenai/Dolci-Think-RL-7B.

Robots used

null

Links

HuggingFace dataset

allenai/Dolci-Think-RL-7B