simmediumrlmetric · varies

Real-Time Generative Policy via Langevin-Guided Flow Matching for Autonomous Driving

Description

Reinforcement learning (RL) is a fundamental methodology in autonomous driving systems, where generative policies exhibit considerable potential by leveraging their ability to model complex distributions to enhance exploration. However, their inherent high inference latency severely impedes their deployment in real-time decision-making and control. To address this issue, we propose diffusion actor-critic with entropy regulator via flow matching (DACER-F) by introducing flow matching into online

Source

http://arxiv.org/abs/2603.02613v1