dataset

OREAL-RL-Prompts

internlm

or hover any field below to flag it

Overview

Name
OREAL-RL-Prompts
Source
internlm
Episodes
0
Robot count
0
Format
parquet
Description
OREAL-RL-Prompts Links Arxiv Github OREAL-7B Model OREAL-32B Model Data Introduction This repository contains the prompts used in the RL training phase of the OREAL project. The prompts are collected from MATH, Numina, and historical AMC/AIME (2024 is excluded). The pass rate of the prompts are calculated with 16 times of inference with OREAL-7B-SFT.
Robots used
null

Links

HuggingFace dataset