← Back to Benchmarks
simmediumeval_datasetmetric · varies
Natural Language Prompt W Correct Ans Dataset Evaluation Instruct Dataset
Description
HuggingFace evaluation dataset: y1xing/natural_language_prompt_w_correct_ans_dataset_evaluation_instruct_dataset