Festivus
Home
Data
Contribute
Search
← Back to Benchmarks
sim
medium
eval_dataset
metric · varies
Deepscalar Rl Test Benchmark
Description
HuggingFace evaluation dataset: CohenQu/deepscalar_RL_test_benchmark
Source
https://huggingface.co/datasets/CohenQu/deepscalar_RL_test_benchmark