← Back to Benchmarks
simmediumrlmetric · varies
Real-Time Generative Policy via Langevin-Guided Flow Matching for Autonomous Driving
Description
Reinforcement learning (RL) is a fundamental methodology in autonomous driving systems, where generative policies exhibit considerable potential by leveraging their ability to model complex distributions to enhance exploration. However, their inherent high inference latency severely impedes their deployment in real-time decision-making and control. To address this issue, we propose diffusion actor-critic with entropy regulator via flow matching (DACER-F) by introducing flow matching into online