policy
DRL
inadk · PyTorch
or hover any field below to flag it
Overview
Name
DRL
Author
inadk
Framework
PyTorch
License
unknown
Skill type
locomotion
Evidence level
untested
Task description
Tested the generalization of trained policies to different environment dynamics using the Hopper-v4 environment with the PPO model. Compared on and off policy RL algorithms, namely DDPG, PPO and TD3 in the BipedalWalker-v3 environment.
Spaces
Action space
other · 0-dim · 0Hz
Observation space
- type: other
Links
HuggingFace repo
null
Paper (arXiv)
null
Compatible robots
3+17 mentioned but not in catalog yetCompatible environments
0No environments list DRL yet.
Datasets that reference this policy
0No datasets reference DRL yet.