gemma2-gutenberg-9B / README.md
nbeerbower's picture
Update README.md
ebdab2d verified
|
raw
history blame
545 Bytes
metadata
library_name: transformers
base_model:
  - UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3
datasets:
  - jondurbin/gutenberg-dpo-v0.1
license: gemma

gemma2-gutenberg-9B

UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3 finetuned on jondurbin/gutenberg-dpo-v0.1.

Method

Finetuned using an RTX 4090 using ORPO for 3 epochs.

Fine-tune Llama 3 with ORPO