dataset

Nemotron-3-Nano-RL-Training-Blend

nvidia

or hover any field below to flag it

Overview

Name
Nemotron-3-Nano-RL-Training-Blend
Source
nvidia
Episodes
0
Robot count
0
Format
other
Description
Dataset Description: Nemotron-3-Nano-RL-Training-Blend is a curated dataset blend used to train the Nemotron-3-Nano-30B-A3B model. The blend consists of the following component datasets, with mixing ratios shown in parentheses: nvidia/Nemotron-RL-instruction_following (0.17) nvidia/Nemotron-RL-knowledge-mcqa (0.20) nvidia/Nemotron-RL-agent-workplace_assistant (0.10) nvidia/Nemotron-RL-instruction_following-structured_outputs (0.05) nvidia/Nemotron-RL-coding-competitive_coding… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/Nemotron-3-Nano-RL-Training-Blend.
Robots used
null

Links