dataset

OREAL-RL-Prompts

internlm

or hover any field below to flag it

Overview

Name

OREAL-RL-Prompts

Source

internlm

Episodes

Robot count

Format

parquet

Description

OREAL-RL-Prompts Links Arxiv Github OREAL-7B Model OREAL-32B Model Data Introduction This repository contains the prompts used in the RL training phase of the OREAL project. The prompts are collected from MATH, Numina, and historical AMC/AIME (2024 is excluded). The pass rate of the prompts are calculated with 16 times of inference with OREAL-7B-SFT.

Robots used

null

Links

HuggingFace dataset

internlm/OREAL-RL-Prompts