dataset
MLR_full_trajectory
sxiong
or hover any field below to flag it
Overview
Name
MLR_full_trajectory
Source
sxiong
Episodes
0
Robot count
0
Format
json
Description
DeepSeek-R1 Reasoning Trajectories
This dataset contains raw reasoning trajectories generated by DeepSeek-R1, used in (ICLR 2026) Enhancing Language Model Reasoning with Structured Multi-Level Modeling.
The trajectories capture the full reasoning process produced by the model, including hidden chain-of-thought reasoning and final responses.
They are provided to support research on reasoning analysis, trajectory supervision, and multi-step reasoning training.
Available… See the full description on the dataset page: https://huggingface.co/datasets/sxiong/MLR_full_trajectory.
Robots used
null
Links
HuggingFace dataset