dataset

Nemotron-RL-Agentic-Function-Calling-Pivot-v1

nvidia

or hover any field below to flag it

Overview

Name

Source

nvidia

Episodes

Robot count

Format

json

Description

Dataset Description: This is a RL dataset for general function-calling by utilizing existing expert tool-use trajectories. We pose each assistant step of the trajectory as a separate behavior cloning problem where the policy model is incentivized to match the tool call choices of the expert model. This dataset is released as part of NVIDIA NeMo Gym, a framework for building reinforcement learning environments to train large language models. NeMo Gym contains a growing collection of… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/Nemotron-RL-Agentic-Function-Calling-Pivot-v1.

Robots used

null

Links

HuggingFace dataset

nvidia/Nemotron-RL-Agentic-Function-Calling-Pivot-v1