dataset
ODA-Fin-RL-12k
OpenDataArena
or hover any field below to flag it
Overview
Name
ODA-Fin-RL-12k
Source
OpenDataArena
Episodes
0
Robot count
0
Format
parquet
Description
Unlocking Data Value in Finance: A Study on Distillation
and Difficulty-Aware Training
📖 Overview
ODA-Fin-RL-12K is a carefully curated dataset for reinforcement learning (RL) in financial domain, comprising 12,187 hard-but-verifiable samples. Designed to complement ODA-Fin-SFT-318K, this dataset targets challenging financial reasoning tasks with concise, reliably verifiable answers—optimized for RL training.
🎯 Key Highlights
12K Hard Samples: Curated… See the full description on the dataset page: https://huggingface.co/datasets/OpenDataArena/ODA-Fin-RL-12k.
Robots used
null
Links
HuggingFace dataset