dataset

BasicSpatialAbility

EmbodiedCity

or hover any field below to flag it

Overview

Name
BasicSpatialAbility
Source
EmbodiedCity
Episodes
0
Robot count
0
Format
other
Description
[ACL'25 Main] Defining and Evaluating Visual Language Models’ Basic Spatial Abilities: A Perspective from Psychometrics [!IMPORTANT] You can find the sample testing code on GitHub! This dataset is a benchmark designed for evaluating Multimodal Large Language Models' Basic Spatial Abilities based on authentic Psychometric theories. It is structured specifically to support both Zero-shot and Few-shot evaluation protocols. Split Name Role Description test Query Set… See the full description on the dataset page: https://huggingface.co/datasets/EmbodiedCity/BasicSpatialAbility.
Robots used
null

Links