dataset
BRIGHT
mteb
or hover any field below to flag it
Overview
Name
BRIGHT
Source
mteb
Episodes
0
Robot count
0
Format
parquet
Description
BRIGHT benchmark
BRIGHT is the first text retrieval benchmark that requires intensive reasoning to retrieve relevant documents.
The queries are collected from diverse domains (StackExchange, LeetCode, and math competitions), all sourced from realistic human data.
Experiments show that existing retrieval models perform poorly on BRIGHT, where the highest score is only 22.1 measured by nDCG@10.
BRIGHT provides a good testbed for future retrieval research in more realistic and… See the full description on the dataset page: https://huggingface.co/datasets/mteb/BRIGHT.
Robots used
null
Links
HuggingFace dataset