dataset
Nemotron-3-Nano-RL-Training-Blend
nvidia
or hover any field below to flag it
Overview
Name
Nemotron-3-Nano-RL-Training-Blend
Source
nvidia
Episodes
0
Robot count
0
Format
other
Description
Dataset Description:
Nemotron-3-Nano-RL-Training-Blend is a curated dataset blend used to train the Nemotron-3-Nano-30B-A3B model. The blend consists of the following component datasets, with mixing ratios shown in parentheses:
nvidia/Nemotron-RL-instruction_following (0.17)
nvidia/Nemotron-RL-knowledge-mcqa (0.20)
nvidia/Nemotron-RL-agent-workplace_assistant (0.10)
nvidia/Nemotron-RL-instruction_following-structured_outputs (0.05)
nvidia/Nemotron-RL-coding-competitive_coding… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/Nemotron-3-Nano-RL-Training-Blend.
Robots used
null
Links
HuggingFace dataset