yusufcakmak
commited on
Fixed typo
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ Trendyol LLM v1.0 - DPO is a generative model that is based on Mistral 7B model.
|
|
20 |
|
21 |
**Output** Models generate text only.
|
22 |
|
23 |
-
**Model Architecture** Trendyol LLM is an auto-regressive language model (based on Mistral 7b) that uses an optimized transformer architecture. The
|
24 |
|
25 |
- **lr**=5e-6
|
26 |
- **lora_rank**=64
|
|
|
20 |
|
21 |
**Output** Models generate text only.
|
22 |
|
23 |
+
**Model Architecture** Trendyol LLM is an auto-regressive language model (based on Mistral 7b) that uses an optimized transformer architecture. The DPO version is fine-tuned on 11K sets (prompt-chosen-reject) with the following trainables by using LoRA:
|
24 |
|
25 |
- **lr**=5e-6
|
26 |
- **lora_rank**=64
|