dataset

MLR_full_trajectory

sxiong

or hover any field below to flag it

Overview

Name
MLR_full_trajectory
Source
sxiong
Episodes
0
Robot count
0
Format
json
Description
DeepSeek-R1 Reasoning Trajectories This dataset contains raw reasoning trajectories generated by DeepSeek-R1, used in (ICLR 2026) Enhancing Language Model Reasoning with Structured Multi-Level Modeling. The trajectories capture the full reasoning process produced by the model, including hidden chain-of-thought reasoning and final responses. They are provided to support research on reasoning analysis, trajectory supervision, and multi-step reasoning training. Available… See the full description on the dataset page: https://huggingface.co/datasets/sxiong/MLR_full_trajectory.
Robots used
null

Links

HuggingFace dataset