dataset

orz_math_57k_collection

Open-Reasoner-Zero

or hover any field below to flag it

Overview

Name

Source

Open-Reasoner-Zero

Episodes

Robot count

Format

other

Description

Open Reasoner Zero An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model Paper Arxiv Link 👁️ Overview 🌊 We introduce Open-Reasoner-Zero, the first open source implementation of large-scale reasoning-oriented RL training focusing on scalability, simplicity and accessibility. To enable broader participation in this pivotal moment we witnessed and accelerate research towards artificial general intelligence (AGI)… See the full description on the dataset page: https://huggingface.co/datasets/Open-Reasoner-Zero/orz_math_57k_collection.

Robots used

null

Links

HuggingFace dataset

Open-Reasoner-Zero/orz_math_57k_collection