← Back to Benchmarks
simmediumeval_datasetmetric · varies

Multibanana Benchmark

Description

HuggingFace evaluation dataset: kohsei/MultiBanana-Benchmark

Source

https://huggingface.co/datasets/kohsei/MultiBanana-Benchmark