← Back to Benchmarks
simmediumroboticsmetric · varies

DINO Pre-training for Vision-based End-to-end Autonomous Driving

Description

In this article, we focus on the pre-training of visual autonomous driving agents in the context of imitation learning. Current methods often rely on a classification-based pre-training, which we hypothesise to be holding back from extending capabilities of implicit image understanding. We propose pre-training the visual encoder of a driving agent using the self-distillation with no labels (DINO) method, which relies on a self-supervised learning paradigm.% and is trained on an unrelated task. O

Source

http://arxiv.org/abs/2407.10803v1