dataset

harbor-rl-data

ElvinDu518

or hover any field below to flag it

Overview

Name

harbor-rl-data

Source

ElvinDu518

Episodes

Robot count

Format

other

Description

harbor-rl-data Harbor task datasets for online RL training with rllm-harbor. Each task is a self-contained Harbor task directory containing: task.toml — task metadata (resources, timeouts) instruction.md — problem statement shown to the agent tests/test.sh — evaluation script (runs agent patch, grades with swebench) tests/config.json — install config (python version, packages, test command) Reward is 1 if all FAIL_TO_PASS tests pass and all PASS_TO_PASS tests remain passing, else… See the full description on the dataset page: https://huggingface.co/datasets/ElvinDu518/harbor-rl-data.

Robots used

null

Links

HuggingFace dataset

ElvinDu518/harbor-rl-data