dataset

Fast-Math-R1-SFT

RabotniKuma

or hover any field below to flag it

Overview

Name

Fast-Math-R1-SFT

Source

RabotniKuma

Episodes

Robot count

Format

csv

Description

This repository contains the First stage SFT dataset as presented in the paper A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning. This dataset is used for the intensive Supervised Fine-Tuning (SFT) phase, crucial for pushing the model's mathematical accuracy. Project GitHub Repository: https://github.com/RabotniKuma/Kaggle-AIMO-Progress-Prize-2-9th-Place-Solution Dataset Construction This dataset was… See the full description on the dataset page: https://huggingface.co/datasets/RabotniKuma/Fast-Math-R1-SFT.

Robots used

null

Links

HuggingFace dataset

RabotniKuma/Fast-Math-R1-SFT