dataset
code-verl-unified
sungyub
or hover any field below to flag it
Overview
Name
code-verl-unified
Source
sungyub
Episodes
0
Robot count
0
Format
parquet
Description
Unified Code VERL Dataset
Overview
This dataset aggregates seven code-reasoning collections into a single VERL-formatted repository containing approximately 958,539 unique problems. The compilation prioritizes consistent extra_info structure across all source materials for seamless compatibility with VERL training frameworks.
Dataset Composition
Seven distinct splits comprise the collection:
Split
Problems
Percentage
Format
kodcode_v1_verl
434,876… See the full description on the dataset page: https://huggingface.co/datasets/sungyub/code-verl-unified.
Robots used
null
Links
HuggingFace dataset