policy

RLCoPilot

nicolas-rabault · PyTorch

or hover any field below to flag it

Overview

Name

RLCoPilot

Author

nicolas-rabault

Framework

PyTorch

License

MIT

Skill type

other

Evidence level

untested

Task description

Autonomous RL training copilot for Claude Code. Tell Claude "train the robot to run faster" — it writes the code, launches training on your GPU server, monitors metrics via WandB, evaluates policies, and iterates on failures. Fully autonomous training loops with multi-agent orchestration.

Spaces

Action space

other · 0-dim · 0Hz

Observation space

type: other

Links

HuggingFace repo

null

Paper (arXiv)

null

Compatible environments

No environments list RLCoPilot yet.

Datasets that reference this policy

No datasets reference RLCoPilot yet.

Overview

Spaces

Links

Compatible robots

Compatible environments

Datasets that reference this policy