dataset
Alignment_and_Reasoning_RL
nirmalpratheep
or hover any field below to flag it
Overview
Name
Alignment_and_Reasoning_RL
Source
nirmalpratheep
Episodes
0
Robot count
0
Format
other
Description
Alignment and Reasoning RL Exercises on Math Dataset - Qwen2.5-Math-1.5B Model
Robots used
null
Links
HuggingFace dataset
null