dataset
Nemotron-RL-Agentic-Function-Calling-Pivot-v1
nvidia
or hover any field below to flag it
Overview
Name
Nemotron-RL-Agentic-Function-Calling-Pivot-v1
Source
nvidia
Episodes
0
Robot count
0
Format
json
Description
Dataset Description:
This is a RL dataset for general function-calling by utilizing existing expert tool-use trajectories. We pose each assistant step of the trajectory as a separate behavior cloning problem where the policy model is incentivized to match the tool call choices of the expert model.
This dataset is released as part of NVIDIA NeMo Gym, a framework for building reinforcement learning environments to train large language models. NeMo Gym contains a growing collection of… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/Nemotron-RL-Agentic-Function-Calling-Pivot-v1.
Robots used
null
Links
HuggingFace dataset