Collections

Discover the best community collections!

Collections including paper arxiv:2403.07691
About ORPO
Contains some information and experiments fine-tuning LLMs using 🤗 `trl.ORPOTrainer`
RLHF
Collection by 11 days ago
Papers - Fine-tuning
Collection by 30 days ago
ORPO
This is the official collection of "ORPO: Monolithic Preference Optimization without Reference Model".
Training
Collection by Dec 11, 2024
RLHF
Collection by Mar 19, 2024
Papers
Large Language Model (LLM) and NLP related papers.
NLP paper
Collection by Jun 11, 2024