dataset

Fast-Math-R1-SFT

RabotniKuma

or hover any field below to flag it

Overview

Name
Fast-Math-R1-SFT
Source
RabotniKuma
Episodes
0
Robot count
0
Format
csv
Description
This repository contains the First stage SFT dataset as presented in the paper A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning. This dataset is used for the intensive Supervised Fine-Tuning (SFT) phase, crucial for pushing the model's mathematical accuracy. Project GitHub Repository: https://github.com/RabotniKuma/Kaggle-AIMO-Progress-Prize-2-9th-Place-Solution Dataset Construction This dataset was… See the full description on the dataset page: https://huggingface.co/datasets/RabotniKuma/Fast-Math-R1-SFT.
Robots used
null

Links

HuggingFace dataset