dataset
apps-control-arena
RoganInglis
or hover any field below to flag it
Overview
Name
apps-control-arena
Source
RoganInglis
Episodes
0
Robot count
0
Format
parquet
Description
APPS Control Arena Dataset
Unified dataset combining APPS problems with backdoors from both the AI Control paper and Control-Tax paper.
Dataset Description
This dataset is based on the codeparrot/apps dataset,
enhanced with backdoor solutions from two sources:
APPS Backdoors: From "AI Control: Improving Safety Despite Intentional Subversion" / TylordTheGreat/apps-backdoors-04-02-25
Control-Tax Backdoors: From "Control Tax: The Price of Keeping AI in Check"… See the full description on the dataset page: https://huggingface.co/datasets/RoganInglis/apps-control-arena.
Robots used
null
Links
HuggingFace dataset