dataset
Nemotron-Cascade-2-RL-data
nvidia
or hover any field below to flag it
Overview
Name
Nemotron-Cascade-2-RL-data
Source
nvidia
Episodes
0
Robot count
0
Format
json
Description
Dataset Description:
The Nemotron-Cascade-2-RL dataset is a curated reinforcement learning (RL) dataset blend used to train Nemotron-Cascade-2-30B-A3B model. It includes instruction-following RL, multi-domain RL, on-policy distillation, and software engineering RL (SWE-RL) data.
This dataset is ready for commercial use.
The dataset contains the following subset:
IF-RL
Contains 45,879 training samples for instruction-following RL. Our curation process mainly resolves… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/Nemotron-Cascade-2-RL-data.
Robots used
null
Links
HuggingFace dataset