Skip to content
forked from huggingface/trl

for when your LLM is bad and you need to reinforce it to be a good robot ^~^

License

Notifications You must be signed in to change notification settings

allura-org/clicker

 
 

Repository files navigation

Clicker

for when your LLM is bad and you need to reinforce it to be a good robot ^~^

okay, but what is it really

a smol fork of TRL to vendor the trainers we need at allura and get rid of some of the bloat unneeded things we don't!

included trainers (recommended):

  • SFTTrainer
  • DPOTrainer + variants
  • KTOTrainer
  • RLOOTrainer
  • GRPOTrainer + variants you need nothing else.

notes

there may still be remnants of old docs, names, etc!

About

for when your LLM is bad and you need to reinforce it to be a good robot ^~^

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.5%
  • Other 0.5%