dataset

rlvr-guru-raw-data-extended

AmanPriyanshu

or hover any field below to flag it

Overview

Name
rlvr-guru-raw-data-extended
Source
AmanPriyanshu
Episodes
0
Robot count
0
Format
other
Description
RLVR GURU Extended: Compiling a 150K Cross-Domain Dataset for RLVR A comprehensive cross-domain reasoning dataset containing 150,000 training samples and 221,332 test samples across diverse reasoning-intensive domains. This dataset extends the foundational work from the GURU dataset (Cheng et al., 2025) by incorporating additional STEM reasoning domains (MedMCQA and CommonsenseQA) while maintaining rigorous quality standards and verification mechanisms essential for reinforcement… See the full description on the dataset page: https://huggingface.co/datasets/AmanPriyanshu/rlvr-guru-raw-data-extended.
Robots used
null

Links