Festivus
Home
Data
Contribute
Changelog
Search
← Back to Benchmarks
sim
medium
eval_dataset
metric · varies
Results Public
Description
HuggingFace evaluation dataset: gaia-benchmark/results_public
Source
https://huggingface.co/datasets/gaia-benchmark/results_public