policy

LLM-fine-tuner

Yog-Sotho · PyTorch

or hover any field below to flag it

Overview

Name
LLM-fine-tuner
Author
Yog-Sotho
Framework
PyTorch
License
GPL-3.0
Skill type
other
Evidence level
untested
Task description
Powerful no-code LLM fine-tuner: upload data → train → deploy in minutes. Unsloth 2-5× acceleration · QLoRA/DPO/RLHF/PPO/ORPO · Reward Model training · GGUF export · vLLM inference · BLEU/ROUGE/BERTScore · full CLI · Heretic Mode to unlock full model potential

Spaces

Action space
other · 0-dim · 0Hz
Observation space
  • type: other

Links

HuggingFace repo
null
Paper (arXiv)
null

Compatible robots

3+17 mentioned but not in catalog yet

Compatible environments

0

No environments list LLM-fine-tuner yet.

Datasets that reference this policy

0

No datasets reference LLM-fine-tuner yet.