dataset

TeaMs-RL

SafeRL-Lab

or hover any field below to flag it

Overview

Name
TeaMs-RL
Source
SafeRL-Lab
Episodes
0
Robot count
0
Format
other
Description
[TMLR] TeaMs-RL: Teaching LLMs to Generate Better Instruction Datasets via Reinforcement Learning.
Robots used
null

Links

HuggingFace dataset
null