Update README.md
Browse files
README.md
CHANGED
@@ -15,6 +15,16 @@ This repo contains a GGUF Quantized versions of the SFR-Iterative-DPO-Llama-3-8B
|
|
15 |
weights from:
|
16 |
[maldv/SFR-Iterative-DPO-LLaMA-3-8B-R](https://huggingface.co/maldv/SFR-Iterative-DPO-LLaMA-3-8B-R)
|
17 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
## Introduction
|
19 |
We release a state-of-the-art instruct model of its class, **SFR-Iterative-DPO-LLaMA-3-8B-R**.
|
20 |
On all three widely-used instruct model benchmarks: **Alpaca-Eval-V2**, **MT-Bench**, **Chat-Arena-Hard**, our model outperforms all models of similar size (e.g., LLaMA-3-8B-it), most large open-sourced models (e.g., Mixtral-8x7B-it),
|
|
|
15 |
weights from:
|
16 |
[maldv/SFR-Iterative-DPO-LLaMA-3-8B-R](https://huggingface.co/maldv/SFR-Iterative-DPO-LLaMA-3-8B-R)
|
17 |
|
18 |
+
## Prompt format
|
19 |
+
|
20 |
+
```
|
21 |
+
<|start_header_id|>system<|end_header_id|>
|
22 |
+
{system_prompt}<|eot_id|>
|
23 |
+
<|start_header_id|>user<|end_header_id|>
|
24 |
+
{prompt}<|eot_id|>
|
25 |
+
<|start_header_id|>assistant<|end_header_id|>
|
26 |
+
|
27 |
+
|
28 |
## Introduction
|
29 |
We release a state-of-the-art instruct model of its class, **SFR-Iterative-DPO-LLaMA-3-8B-R**.
|
30 |
On all three widely-used instruct model benchmarks: **Alpaca-Eval-V2**, **MT-Bench**, **Chat-Arena-Hard**, our model outperforms all models of similar size (e.g., LLaMA-3-8B-it), most large open-sourced models (e.g., Mixtral-8x7B-it),
|