dataset
TeaMs-RL
SafeRL-Lab
or hover any field below to flag it
Overview
Name
TeaMs-RL
Source
SafeRL-Lab
Episodes
0
Robot count
0
Format
other
Description
[TMLR] TeaMs-RL: Teaching LLMs to Generate Better Instruction Datasets via Reinforcement Learning.
Robots used
null
Links
HuggingFace dataset
null