dataset

ODA-Fin-RL-12k

OpenDataArena

or hover any field below to flag it

Overview

Name

ODA-Fin-RL-12k

Source

OpenDataArena

Episodes

Robot count

Format

parquet

Description

Unlocking Data Value in Finance: A Study on Distillation and Difficulty-Aware Training 📖 Overview ODA-Fin-RL-12K is a carefully curated dataset for reinforcement learning (RL) in financial domain, comprising 12,187 hard-but-verifiable samples. Designed to complement ODA-Fin-SFT-318K, this dataset targets challenging financial reasoning tasks with concise, reliably verifiable answers—optimized for RL training. 🎯 Key Highlights 12K Hard Samples: Curated… See the full description on the dataset page: https://huggingface.co/datasets/OpenDataArena/ODA-Fin-RL-12k.

Robots used

null

Links

HuggingFace dataset

OpenDataArena/ODA-Fin-RL-12k