policy
RewardModelOnlyOnAnswer
RajuEEE · PyTorch
or hover any field below to flag it
Overview
Name
RewardModelOnlyOnAnswer
Author
RajuEEE
Framework
PyTorch
License
unknown
Skill type
other
Evidence level
untested
Task description
Policy model RewardModelOnlyOnAnswer by RajuEEE.
Spaces
Action space
other · 0-dim · 0Hz
Observation space
- type: other
Links
HuggingFace repo
Paper (arXiv)
null
Compatible robots
3+17 mentioned but not in catalog yetCompatible environments
0No environments list RewardModelOnlyOnAnswer yet.
Datasets that reference this policy
0No datasets reference RewardModelOnlyOnAnswer yet.