policy

hyper-efficient-rl

yus100 · PyTorch

or hover any field below to flag it

Overview

Name

hyper-efficient-rl

Author

yus100

Framework

PyTorch

License

unknown

Skill type

other

Evidence level

untested

Task description

Resource-efficient RL fine-tuning of LLMS with length-aware context optimization and online curriculum learning (SPEED)

Action space

other · 0-dim · 0Hz

Observation space

HuggingFace repo

null

Paper (arXiv)

null

No environments list hyper-efficient-rl yet.

No datasets reference hyper-efficient-rl yet.