dataset
guru-RL-92k-extra-info-compressed
LLM360
or hover any field below to flag it
Overview
Name
guru-RL-92k-extra-info-compressed
Source
LLM360
Episodes
0
Robot count
0
Format
parquet
Description
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Note for this extra-info-compressed data version!
The dataset provided in this repository is specifically intended for use with the latest release of VeRL (v0.4.0). Since VeRL rl_dataset.py processes datasets as datasets.Dataset, it is essential that the structure of all Parquet files remains fully consistent. This repository is designed to meet that requirement.
In this repo, the… See the full description on the dataset page: https://huggingface.co/datasets/LLM360/guru-RL-92k-extra-info-compressed.
Robots used
null
Links
HuggingFace dataset