← Back to Benchmarks
simmediumroboticsmetric · varies

Pedestrian Crossing Intent Prediction via Psychological Features and Transformer Fusion

Description

Pedestrian intention prediction needs to be accurate for autonomous vehicles to navigate safely in urban environments. We present a lightweight, socially informed architecture for pedestrian intention prediction. It fuses four behavioral streams (attention, position, situation, and interaction) using highway encoders, a compact 4-token Transformer, and global self-attention pooling. To quantify uncertainty, we incorporate two complementary heads: a variational bottleneck whose KL divergence capt

Source

http://arxiv.org/abs/2603.19533v1