dataset

drkernel-coldstart-8k

hkust-nlp

or hover any field below to flag it

Overview

Name
drkernel-coldstart-8k
Source
hkust-nlp
Episodes
0
Robot count
0
Format
parquet
Description
DR.Kernel Cold-Start Dataset Paper | Code This directory documents the format of hkust-nlp/drkernel-coldstart-8k. The cold-start set is used for supervised fine-tuning (SFT) before RL in DR.Kernel. As described in the paper, it is built from 5-turn multi-turn trajectories collected with KernelGYM feedback. Overview Purpose: initialize kernel-generation ability (Triton coding + iterative optimization) before TRLOO/MRS/PR/PRS RL.Data form: one row per full multi-turn… See the full description on the dataset page: https://huggingface.co/datasets/hkust-nlp/drkernel-coldstart-8k.
Robots used
null

Links