dataset

orz_math_57k_collection

Open-Reasoner-Zero

or hover any field below to flag it

Overview

Name
orz_math_57k_collection
Source
Open-Reasoner-Zero
Episodes
0
Robot count
0
Format
other
Description
Open Reasoner Zero An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model Paper Arxiv Link πŸ‘οΈ Overview 🌊 We introduce Open-Reasoner-Zero, the first open source implementation of large-scale reasoning-oriented RL training focusing on scalability, simplicity and accessibility. To enable broader participation in this pivotal moment we witnessed and accelerate research towards artificial general intelligence (AGI)… See the full description on the dataset page: https://huggingface.co/datasets/Open-Reasoner-Zero/orz_math_57k_collection.
Robots used
null

Links