dataset

ODA-Fin-RL-12k

OpenDataArena

or hover any field below to flag it

Overview

Name
ODA-Fin-RL-12k
Source
OpenDataArena
Episodes
0
Robot count
0
Format
parquet
Description
Unlocking Data Value in Finance: A Study on Distillation and Difficulty-Aware Training 📖 Overview ODA-Fin-RL-12K is a carefully curated dataset for reinforcement learning (RL) in financial domain, comprising 12,187 hard-but-verifiable samples. Designed to complement ODA-Fin-SFT-318K, this dataset targets challenging financial reasoning tasks with concise, reliably verifiable answers—optimized for RL training. 🎯 Key Highlights 12K Hard Samples: Curated… See the full description on the dataset page: https://huggingface.co/datasets/OpenDataArena/ODA-Fin-RL-12k.
Robots used
null

Links

HuggingFace dataset