← Back to Benchmarks
simmediumroboticsmetric · varies

LongTail Driving Scenarios with Reasoning Traces: The KITScenes LongTail Dataset

Description

In real-world domains such as self-driving, generalization to rare scenarios remains a fundamental challenge. To address this, we introduce a new dataset designed for end-to-end driving that focuses on long-tail driving events. We provide multi-view video data, trajectories, high-level instructions, and detailed reasoning traces, facilitating in-context learning and few-shot generalization. The resulting benchmark for multimodal models, such as VLMs and VLAs, goes beyond safety and comfort metri

Source

http://arxiv.org/abs/2603.23607v2