← Back to Benchmarks
simmediumoffline-rlmetric · varies

Latent Embedding Adaptation for Human Preference Alignment in Diffusion Planners

Description

This work addresses the challenge of personalizing trajectories generated in automated decision-making systems by introducing a resource-efficient approach that enables rapid adaptation to individual users' preferences. Our method leverages a pretrained conditional diffusion model with Preference Latent Embeddings (PLE), trained on a large, reward-free offline dataset. The PLE serves as a compact representation for capturing specific user preferences. By adapting the pretrained model using our p

Source

http://arxiv.org/abs/2503.18347v1