policy

2DLunarLanderPPO

WhiteMetagross · PyTorch

or hover any field below to flag it

Overview

Name

2DLunarLanderPPO

Author

WhiteMetagross

Framework

PyTorch

License

unknown

Skill type

other

Evidence level

untested

Task description

This projects use PPO RL algorithm to train a model for the Lunar Lander Continuous v3 enviroment, with comparisons between Manual-LLM tunded hyperparameters and Optuna tuned hyperparameters.

Spaces

Action space

other · 0-dim · 0Hz

Observation space

type: other

Links

HuggingFace repo

null

Paper (arXiv)

null

Compatible robots

3+17 mentioned but not in catalog yet

SpotBoston Dynamics T1Booster Robotics ApolloApptronik

Compatible environments

No environments list 2DLunarLanderPPO yet.

Datasets that reference this policy

No datasets reference 2DLunarLanderPPO yet.