← Back to Benchmarks
simmediumeval_datasetmetric · varies

Ii Agent Gaia Benchmark Validation

Description

HuggingFace evaluation dataset: Intelligent-Internet/ii-agent_gaia-benchmark_validation

Source

https://huggingface.co/datasets/Intelligent-Internet/ii-agent_gaia-benchmark_validation