policy

DRL_Project_TRPO

ankitdipto · PyTorch

or hover any field below to flag it

Overview

Name

DRL_Project_TRPO

Author

ankitdipto

Framework

PyTorch

License

unknown

Skill type

locomotion

Evidence level

untested

Task description

Implmentation of Trust Region Policy Optimization (TRPO) from scratch. Tested on standard mujoco-based gymnasium environments. Extended to a harder task of Quadrupedal Locomotion with the help of gait priors.

Spaces

Action space

other · 0-dim · 0Hz

Observation space

type: other

Links

HuggingFace repo

null

Paper (arXiv)

null

Compatible environments

No environments list DRL_Project_TRPO yet.

Datasets that reference this policy

No datasets reference DRL_Project_TRPO yet.

Overview

Spaces

Links

Compatible robots

Compatible environments

Datasets that reference this policy