dataset
OREAL-RL-Prompts
internlm
or hover any field below to flag it
Overview
Name
OREAL-RL-Prompts
Source
internlm
Episodes
0
Robot count
0
Format
parquet
Description
OREAL-RL-Prompts
Links
Arxiv
Github
OREAL-7B Model
OREAL-32B Model
Data
Introduction
This repository contains the prompts used in the RL training phase of the OREAL project. The prompts are collected from MATH, Numina, and historical AMC/AIME (2024 is excluded). The pass rate of the prompts are calculated with 16 times of inference with OREAL-7B-SFT.
Robots used
null
Links
HuggingFace dataset