dataset
KodCode-Light-RL-10K
KodCode
or hover any field below to flag it
Overview
Name
KodCode-Light-RL-10K
Source
KodCode
Episodes
0
Robot count
0
Format
parquet
Description
🐱 KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding
KodCode is the largest fully-synthetic open-source dataset providing verifiable solutions and tests for coding tasks. It contains 12 distinct subsets spanning various domains (from algorithmic to package-specific knowledge) and difficulty levels (from basic coding exercises to interview and competitive programming challenges). KodCode is designed for both supervised fine-tuning (SFT) and RL tuning.
🕸️… See the full description on the dataset page: https://huggingface.co/datasets/KodCode/KodCode-Light-RL-10K.
Robots used
null
Links
HuggingFace dataset