dataset

VPoS

ghazishazan

or hover any field below to flag it

Overview

Name
VPoS
Source
ghazishazan
Episodes
0
Robot count
0
Format
other
Description
VPoS-Bench: Video Pointing and Segmentation Benchmark VPoS-Bench is a challenging out-of-distribution benchmark designed to evaluate the spatio-temporal pointing and reasoning capabilities of video-language models. It covers a diverse set of five real-world application domains, with fine-grained point-level and segmentation annotations that enable robust evaluation of multimodal models under realistic, temporally complex scenarios. Webpage: VideoMolmo Paper: VideoMolmo:… See the full description on the dataset page: https://huggingface.co/datasets/ghazishazan/VPoS.
Robots used
null

Links

HuggingFace dataset