dataset

MLR_full_trajectory

sxiong

or hover any field below to flag it

Overview

Name

MLR_full_trajectory

Source

sxiong

Episodes

Robot count

Format

json

Description

DeepSeek-R1 Reasoning Trajectories This dataset contains raw reasoning trajectories generated by DeepSeek-R1, used in (ICLR 2026) Enhancing Language Model Reasoning with Structured Multi-Level Modeling. The trajectories capture the full reasoning process produced by the model, including hidden chain-of-thought reasoning and final responses. They are provided to support research on reasoning analysis, trajectory supervision, and multi-step reasoning training. Available… See the full description on the dataset page: https://huggingface.co/datasets/sxiong/MLR_full_trajectory.

Robots used

null

Links

HuggingFace dataset

sxiong/MLR_full_trajectory