← Back to Benchmarks
simmediumoffline-rlmetric · varies
Latent Embedding Adaptation for Human Preference Alignment in Diffusion Planners
Description
This work addresses the challenge of personalizing trajectories generated in automated decision-making systems by introducing a resource-efficient approach that enables rapid adaptation to individual users' preferences. Our method leverages a pretrained conditional diffusion model with Preference Latent Embeddings (PLE), trained on a large, reward-free offline dataset. The PLE serves as a compact representation for capturing specific user preferences. By adapting the pretrained model using our p