policy

2DLunarLanderPPO

WhiteMetagross · PyTorch

or hover any field below to flag it

Overview

Name
2DLunarLanderPPO
Author
WhiteMetagross
Framework
PyTorch
License
unknown
Skill type
other
Evidence level
untested
Task description
This projects use PPO RL algorithm to train a model for the Lunar Lander Continuous v3 enviroment, with comparisons between Manual-LLM tunded hyperparameters and Optuna tuned hyperparameters.

Spaces

Action space
other · 0-dim · 0Hz
Observation space
  • type: other

Links

HuggingFace repo
null
Paper (arXiv)
null

Compatible robots

3+17 mentioned but not in catalog yet

Compatible environments

0

No environments list 2DLunarLanderPPO yet.

Datasets that reference this policy

0

No datasets reference 2DLunarLanderPPO yet.