dataset
SoundMind
xid32
or hover any field below to flag it
Overview
Name
SoundMind
Source
xid32
Episodes
0
Robot count
0
Format
other
Description
We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for complex reasoning tasks. Building on this resource, we propose SoundMind, a rule-based reinforcement learning (RL) algorithm tailored to endow audio language models (ALM
Robots used
null
Links
HuggingFace dataset
null