dataset

OpenThoughts-Agent-v1-RL

open-thoughts

or hover any field below to flag it

Overview

Name
OpenThoughts-Agent-v1-RL
Source
open-thoughts
Episodes
0
Robot count
0
Format
parquet
Description
Project | SFT dataset | RL dataset | SFT model | RL model OpenThoughts-Agent-v1-RL A curated RL dataset of ~720 tasks with instructions, environments, and verifiers for agentic training.OpenThoughts-Agent is an open-source effort to curate the best datasets for training agents. Our first release includes datasets, models and our research codebase; OpenThinker-Agent-v1 is a model trained for agentic tasks such as Terminal-Bench 2.0 and SWE-Bench. We built… See the full description on the dataset page: https://huggingface.co/datasets/open-thoughts/OpenThoughts-Agent-v1-RL.
Robots used
null

Links