dataset

Alignment_and_Reasoning_RL

nirmalpratheep

or hover any field below to flag it

Overview

Name

Alignment_and_Reasoning_RL

Source

nirmalpratheep

Episodes

0

Robot count

0

Format

other

Description

Alignment and Reasoning RL Exercises on Math Dataset - Qwen2.5-Math-1.5B Model

Robots used

null

Links

HuggingFace dataset

null