dataset

Reverse-Text-RL

PrimeIntellect

or hover any field below to flag it

Overview

Name
Reverse-Text-RL
Source
PrimeIntellect
Episodes
0
Robot count
0
Format
parquet
Description
Reverse-Text-RL A small, scrappy RL dataset used in prime-rl's CI to debug RL training asking a model to reverse small sentences character-by-character. Follows the general format of PrimeIntellect/Reverse-Text-SFT The following script was used to generate the dataset. from datasets import Dataset, load_dataset dataset = load_dataset("willcb/R1-reverse-wikipedia-paragraphs-v1-1000", split="train") prompt = "Reverse the text character-by-character. Put your answer in <reversed_text>… See the full description on the dataset page: https://huggingface.co/datasets/PrimeIntellect/Reverse-Text-RL.
Robots used
null

Links