dataset

UrbanVideo-Bench

EmbodiedCity

or hover any field below to flag it

Overview

Name
UrbanVideo-Bench
Source
EmbodiedCity
Episodes
0
Robot count
0
Format
parquet
Description
[ACL'25 Oral] UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban Spaces This repository contains the dataset introduced in the paper, consisting of two parts: 5k+ multiple-choice question-answering (MCQ) data and 1k+ video clips. Arxiv: https://arxiv.org/pdf/2503.06157 Project: https://embodiedcity.github.io/UrbanVideo-Bench/ Code: https://github.com/EmbodiedCity/UrbanVideo-Bench.code Dataset Description The… See the full description on the dataset page: https://huggingface.co/datasets/EmbodiedCity/UrbanVideo-Bench.
Robots used
null

Links