dataset

apps-control-arena

RoganInglis

or hover any field below to flag it

Overview

Name
apps-control-arena
Source
RoganInglis
Episodes
0
Robot count
0
Format
parquet
Description
APPS Control Arena Dataset Unified dataset combining APPS problems with backdoors from both the AI Control paper and Control-Tax paper. Dataset Description This dataset is based on the codeparrot/apps dataset, enhanced with backdoor solutions from two sources: APPS Backdoors: From "AI Control: Improving Safety Despite Intentional Subversion" / TylordTheGreat/apps-backdoors-04-02-25 Control-Tax Backdoors: From "Control Tax: The Price of Keeping AI in Check"… See the full description on the dataset page: https://huggingface.co/datasets/RoganInglis/apps-control-arena.
Robots used
null

Links