Collections

Discover the best community collections!

Collections including paper arxiv:1909.08593
Papers - Reward Model
Collection by Apr 19, 2024
Papers - OpenAI
Collection by Jun 12, 2024
Papers - Fine-tuning - DPO
Refer to additional papers: https://link.springer.com/article/10.1007/s10994-014-5458-8 and https://link.springer.com/article/10.1007/BF00992696
Papers - Training - Reward Model
Collection by Mar 29, 2024