dataset
orz_math_57k_collection
Open-Reasoner-Zero
or hover any field below to flag it
Overview
Name
orz_math_57k_collection
Source
Open-Reasoner-Zero
Episodes
0
Robot count
0
Format
other
Description
Open Reasoner Zero
An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model
Paper Arxiv Link ποΈ
Overview π
We introduce Open-Reasoner-Zero, the first open source implementation of large-scale reasoning-oriented RL training focusing on scalability, simplicity and accessibility.
To enable broader participation in this pivotal moment we witnessed and accelerate research towards artificial general intelligence (AGI)β¦ See the full description on the dataset page: https://huggingface.co/datasets/Open-Reasoner-Zero/orz_math_57k_collection.
Robots used
null
Links
HuggingFace dataset