dataset

BasicSpatialAbility

EmbodiedCity

or hover any field below to flag it

Overview

Name

BasicSpatialAbility

Source

EmbodiedCity

Episodes

Robot count

Format

other

Description

[ACL'25 Main] Defining and Evaluating Visual Language Models’ Basic Spatial Abilities: A Perspective from Psychometrics [!IMPORTANT] You can find the sample testing code on GitHub! This dataset is a benchmark designed for evaluating Multimodal Large Language Models' Basic Spatial Abilities based on authentic Psychometric theories. It is structured specifically to support both Zero-shot and Few-shot evaluation protocols. Split Name Role Description test Query Set… See the full description on the dataset page: https://huggingface.co/datasets/EmbodiedCity/BasicSpatialAbility.

Robots used

null

Links

HuggingFace dataset

EmbodiedCity/BasicSpatialAbility