Update README.md
Browse files
README.md
CHANGED
@@ -23,7 +23,7 @@ Colab: [link](https://colab.research.google.com/drive/1qxgIPymzW6_H6s_wwXu3lknkk
|
|
23 |
|
24 |
I've changed the prompt format from ChatML to **the original Llama-3 format in v4**. Don't forget to switch formats!
|
25 |
|
26 |
-
**v4+**: LLama-3 prompt format:
|
27 |
```
|
28 |
<|begin_of_text|><|start_header_id|>system<|end_header_id|>
|
29 |
|
@@ -118,6 +118,16 @@ Saiga: В небольшой деревушке на окраине города
|
|
118 |
|
119 |
|
120 |
## Versions
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
121 |
v5:
|
122 |
- [d947b00c56683cd4b2f7ce707edef89318027be4](https://huggingface.co/IlyaGusev/saiga_llama3_8b/commit/d947b00c56683cd4b2f7ce707edef89318027be4)
|
123 |
- KTO-tune over v4, dataset: [lmsys_clean_ru_preferences](https://huggingface.co/datasets/IlyaGusev/lmsys_clean_ru_preferences)
|
|
|
23 |
|
24 |
I've changed the prompt format from ChatML to **the original Llama-3 format in v4**. Don't forget to switch formats!
|
25 |
|
26 |
+
**v4, v5, v6+**: LLama-3 prompt format:
|
27 |
```
|
28 |
<|begin_of_text|><|start_header_id|>system<|end_header_id|>
|
29 |
|
|
|
118 |
|
119 |
|
120 |
## Versions
|
121 |
+
v6:
|
122 |
+
- [b662833f247ca04f1843b356e7ff3ee4aef8086a](https://huggingface.co/IlyaGusev/saiga_llama3_8b/commit/b662833f247ca04f1843b356e7ff3ee4aef8086a)
|
123 |
+
- Other name: saiga_llama3_8b_sft_m10_d1_kto_m2_d2
|
124 |
+
- SFT dataset config: [sft_d1.json](https://github.com/IlyaGusev/saiga/blob/main/configs/datasets/sft_d1.json)
|
125 |
+
- SFT model config: [saiga_llama3_8b_sft_m10.json](https://github.com/IlyaGusev/saiga/blob/main/configs/models/saiga_llama3_8b_sft_m10.json)
|
126 |
+
- KTO dataset config: [pref_d2.json](https://github.com/IlyaGusev/saiga/blob/main/configs/datasets/pref_d2.json)
|
127 |
+
- KTO model config: [saiga_llama3_8b_kto_m2.json](https://github.com/IlyaGusev/saiga/blob/main/configs/models/saiga_llama3_8b_kto_m2.json)
|
128 |
+
- SFT wandb: [link](https://wandb.ai/ilyagusev/rulm_self_instruct/runs/0iepauzu)
|
129 |
+
- KTO wandb: [linl](https://wandb.ai/ilyagusev/rulm_self_instruct/runs/s6l98eot)
|
130 |
+
|
131 |
v5:
|
132 |
- [d947b00c56683cd4b2f7ce707edef89318027be4](https://huggingface.co/IlyaGusev/saiga_llama3_8b/commit/d947b00c56683cd4b2f7ce707edef89318027be4)
|
133 |
- KTO-tune over v4, dataset: [lmsys_clean_ru_preferences](https://huggingface.co/datasets/IlyaGusev/lmsys_clean_ru_preferences)
|