← Back to Benchmarks
simmediumroboticsmetric · varies
TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving
Description
How should we integrate representations from complementary sensors for autonomous driving? Geometry-based fusion has shown promise for perception (e.g. object detection, motion forecasting). However, in the context of end-to-end driving, we find that imitation learning based on existing sensor fusion methods underperforms in complex driving scenarios with a high density of dynamic agents. Therefore, we propose TransFuser, a mechanism to integrate image and LiDAR representations using self-attent