dataset

Nemotron-Cascade-2-RL-data

nvidia

or hover any field below to flag it

Overview

Name
Nemotron-Cascade-2-RL-data
Source
nvidia
Episodes
0
Robot count
0
Format
json
Description
Dataset Description: The Nemotron-Cascade-2-RL dataset is a curated reinforcement learning (RL) dataset blend used to train Nemotron-Cascade-2-30B-A3B model. It includes instruction-following RL, multi-domain RL, on-policy distillation, and software engineering RL (SWE-RL) data. This dataset is ready for commercial use. The dataset contains the following subset: IF-RL Contains 45,879 training samples for instruction-following RL. Our curation process mainly resolves… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/Nemotron-Cascade-2-RL-data.
Robots used
null

Links