policy

DRL

inadk · PyTorch

or hover any field below to flag it

Overview

Name
DRL
Author
inadk
Framework
PyTorch
License
unknown
Skill type
locomotion
Evidence level
untested
Task description
Tested the generalization of trained policies to different environment dynamics using the Hopper-v4 environment with the PPO model. Compared on and off policy RL algorithms, namely DDPG, PPO and TD3 in the BipedalWalker-v3 environment.

Spaces

Action space
other · 0-dim · 0Hz
Observation space
  • type: other

Links

HuggingFace repo
null
Paper (arXiv)
null

Compatible robots

3+17 mentioned but not in catalog yet

Compatible environments

0

No environments list DRL yet.

Datasets that reference this policy

0

No datasets reference DRL yet.