dataset

sft_alfworld_trajectory_dataset_v3to5_admissible_all

kuririrn

or hover any field below to flag it

Overview

Name
sft_alfworld_trajectory_dataset_v3to5_admissible_all
Source
kuririrn
Episodes
0
Robot count
0
Format
parquet
Description
Dataset Card for ALFWorld SFT Trajectories with Admissible Actions (v3–v5) Overview This dataset contains a subset of ALFWorld-style trajectories used for supervised fine-tuning (SFT) of an agent interacting with a textual household environment. It is constructed by merging v3, v4, and v5 of the original u-10bei/sft_alfworld_trajectory_dataset_* series and filtering only trajectories that include admissible actions in the observation text. Each example corresponds to one… See the full description on the dataset page: https://huggingface.co/datasets/kuririrn/sft_alfworld_trajectory_dataset_v3to5_admissible_all.
Robots used
null

Links