dataset

BRIGHT

mteb

or hover any field below to flag it

Overview

Name

BRIGHT

Source

mteb

Episodes

Robot count

Format

parquet

Description

BRIGHT benchmark BRIGHT is the first text retrieval benchmark that requires intensive reasoning to retrieve relevant documents. The queries are collected from diverse domains (StackExchange, LeetCode, and math competitions), all sourced from realistic human data. Experiments show that existing retrieval models perform poorly on BRIGHT, where the highest score is only 22.1 measured by nDCG@10. BRIGHT provides a good testbed for future retrieval research in more realistic and… See the full description on the dataset page: https://huggingface.co/datasets/mteb/BRIGHT.

Robots used

null

Links

HuggingFace dataset

mteb/BRIGHT