dataset
rlvr-guru-raw-data-extended
AmanPriyanshu
or hover any field below to flag it
Overview
Name
rlvr-guru-raw-data-extended
Source
AmanPriyanshu
Episodes
0
Robot count
0
Format
other
Description
RLVR GURU Extended: Compiling a 150K Cross-Domain Dataset for RLVR
A comprehensive cross-domain reasoning dataset containing 150,000 training samples and 221,332 test samples across diverse reasoning-intensive domains. This dataset extends the foundational work from the GURU dataset (Cheng et al., 2025) by incorporating additional STEM reasoning domains (MedMCQA and CommonsenseQA) while maintaining rigorous quality standards and verification mechanisms essential for reinforcement… See the full description on the dataset page: https://huggingface.co/datasets/AmanPriyanshu/rlvr-guru-raw-data-extended.
Robots used
null
Links
HuggingFace dataset