PolyRL is reinforcement learning (RL) framework for large language models (LLM) that leverage polymorphic resources to maximize cost efficiency. Our goal is to create a portable and affordable fine-tuning platform accessible to everyone.
Please refer to ROADMAP.md for the development roadmap of PolyRL.
Please refer to INSTALL.md for installation instructions.
Please refer to USAGE.md for usage instructions.