dataset
Dolci-Think-RL-7B
allenai
or hover any field below to flag it
Overview
Name
Dolci-Think-RL-7B
Source
allenai
Episodes
0
Robot count
0
Format
parquet
Description
Dolci-Think-RL-7B
Dataset Summary
Dolci-Think-RL-7B is the reinforcement learning dataset used to train the Olmo-3-7B-Think model.It contains 102,014 prompts designed to elicit deep reasoning across:
Math
Coding
Precise Instruction Following
General Chat
It blends high-quality curated sources with filtering designed for deliberate reasoning.
Dataset Composition
Total Samples: 102,014
Original Dataset Contribution… See the full description on the dataset page: https://huggingface.co/datasets/allenai/Dolci-Think-RL-7B.
Robots used
null
Links
HuggingFace dataset