← Back to Benchmarks
simmediumeval_datasetmetric · varies

Gaia Subset Benchmark

Description

HuggingFace evaluation dataset: Intelligent-Internet/GAIA-Subset-Benchmark

Source

https://huggingface.co/datasets/Intelligent-Internet/GAIA-Subset-Benchmark