dataset

harbor-rl-data

ElvinDu518

or hover any field below to flag it

Overview

Name
harbor-rl-data
Source
ElvinDu518
Episodes
0
Robot count
0
Format
other
Description
harbor-rl-data Harbor task datasets for online RL training with rllm-harbor. Each task is a self-contained Harbor task directory containing: task.toml — task metadata (resources, timeouts) instruction.md — problem statement shown to the agent tests/test.sh — evaluation script (runs agent patch, grades with swebench) tests/config.json — install config (python version, packages, test command) Reward is 1 if all FAIL_TO_PASS tests pass and all PASS_TO_PASS tests remain passing, else… See the full description on the dataset page: https://huggingface.co/datasets/ElvinDu518/harbor-rl-data.
Robots used
null

Links

HuggingFace dataset