policy

DRL_Project_TRPO

ankitdipto · PyTorch

or hover any field below to flag it

Overview

Name

DRL_Project_TRPO

Author

ankitdipto

Framework

PyTorch

License

unknown

Skill type

locomotion

Evidence level

untested

Task description

Implmentation of Trust Region Policy Optimization (TRPO) from scratch. Tested on standard mujoco-based gymnasium environments. Extended to a harder task of Quadrupedal Locomotion with the help of gait priors.

Spaces

Action space

other · 0-dim · 0Hz

Observation space

type: other

Links

HuggingFace repo

null

Paper (arXiv)

null

Compatible robots

3+17 mentioned but not in catalog yet

SpotBoston Dynamics T1Booster Robotics ApolloApptronik

Compatible environments

No environments list DRL_Project_TRPO yet.

Datasets that reference this policy

No datasets reference DRL_Project_TRPO yet.