dataset

guru-RL-92k-extra-info-compressed

LLM360

or hover any field below to flag it

Overview

Name
guru-RL-92k-extra-info-compressed
Source
LLM360
Episodes
0
Robot count
0
Format
parquet
Description
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Note for this extra-info-compressed data version! The dataset provided in this repository is specifically intended for use with the latest release of VeRL (v0.4.0). Since VeRL rl_dataset.py processes datasets as datasets.Dataset, it is essential that the structure of all Parquet files remains fully consistent. This repository is designed to meet that requirement. In this repo, the… See the full description on the dataset page: https://huggingface.co/datasets/LLM360/guru-RL-92k-extra-info-compressed.
Robots used
null

Links