dataset

OpenThoughts-Agent-v1-RL

open-thoughts

or hover any field below to flag it

Overview

Name

Source

open-thoughts

Episodes

Robot count

Format

parquet

Description

Project | SFT dataset | RL dataset | SFT model | RL model OpenThoughts-Agent-v1-RL A curated RL dataset of ~720 tasks with instructions, environments, and verifiers for agentic training.OpenThoughts-Agent is an open-source effort to curate the best datasets for training agents. Our first release includes datasets, models and our research codebase; OpenThinker-Agent-v1 is a model trained for agentic tasks such as Terminal-Bench 2.0 and SWE-Bench. We built… See the full description on the dataset page: https://huggingface.co/datasets/open-thoughts/OpenThoughts-Agent-v1-RL.

Robots used

null

Links

HuggingFace dataset

open-thoughts/OpenThoughts-Agent-v1-RL