Summary
"TRL: SFT, DPO, PPO, GRPO, reward modeling for LLM RLHF."
Audiocraft
Built-in
Huggingface Hub
Llama Cpp
Browse