simmediumimitationmetric · varies

Learning Personalized Driving Styles via Reinforcement Learning from Human Feedback

Description

Generating human-like and adaptive trajectories is essential for autonomous driving in dynamic environments. While generative models have shown promise in synthesizing feasible trajectories, they often fail to capture the nuanced variability of personalized driving styles due to dataset biases and distributional shifts. To address this, we introduce TrajHF, a human feedback-driven finetuning framework for generative trajectory models, designed to align motion planning with diverse driving styles

Source

http://arxiv.org/abs/2503.10434v2