policy

LAMPO-Language-and-Preference-Conditioned-Reinforcement-Learning-Agent

FastReload · PyTorch

or hover any field below to flag it

Overview

Name

Author

FastReload

Framework

PyTorch

License

MIT

Skill type

other

Evidence level

untested

Task description

LAMPO trains RL agents with language and preference conditioning for flexible multi-objective behavior using PPO and reward decomposition.

Action space

other · 0-dim · 0Hz

Observation space

HuggingFace repo

null

Paper (arXiv)

null

3+17 mentioned but not in catalog yet

No environments list LAMPO-Language-and-Preference-Conditioned-Reinforcement-Learning-Agent yet.

No datasets reference LAMPO-Language-and-Preference-Conditioned-Reinforcement-Learning-Agent yet.