← Back to Benchmarks
simmediumeval_datasetmetric · varies

Critpt

Description

HuggingFace evaluation dataset: CritPt-Benchmark/CritPt

Source

https://huggingface.co/datasets/CritPt-Benchmark/CritPt