dataset

TeaMs-RL

SafeRL-Lab

or hover any field below to flag it

Overview

Name

TeaMs-RL

Source

SafeRL-Lab

Episodes

0

Robot count

0

Format

other

Description

[TMLR] TeaMs-RL: Teaching LLMs to Generate Better Instruction Datasets via Reinforcement Learning.

Robots used

null

Links

HuggingFace dataset

null