markoarnauto
commited on
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -7,14 +7,13 @@ This is a quantization of the [phi-4](https://huggingface.co/microsoft/phi-4).
|
|
7 |
|
8 |
The phi-4 model is a cutting-edge open-source LLM developed using a diverse mix of synthetic datasets, curated public domain web content, and acquired academic resources, including books and Q&A datasets. This deliberate data selection ensures the training of compact yet highly capable models with an emphasis on quality and advanced reasoning. To further enhance its performance, phi-4 underwent a rigorous alignment process that included supervised fine-tuning and direct preference optimization, resulting in precise instruction adherence and robust safety measures.
|
9 |
## Evaluations
|
10 |
-
This model provides an accuracy recovery of 99.
|
11 |
|
12 |
| __English__ | __[phi-4](https://huggingface.co/microsoft/phi-4)__ | __[phi-4-FP8-Dynamic (this)](https://huggingface.co/cortecs/phi-4-FP8-Dynamic)__ |
|
13 |
|:--------------|:------------------------------------------------------|:-----------------------------------------------------------------------------------|
|
14 |
| Avg. | 70.75 | 70.7 |
|
15 |
| Arc | 68.7 | 68.7 |
|
16 |
| Hellaswag | 72.8 | 72.7 |
|
17 |
-
| MMLU | 79.46 | 79.67 |
|
18 |
| | | |
|
19 |
| __French__ | __[phi-4](https://huggingface.co/microsoft/phi-4)__ | __[phi-4-FP8-Dynamic (this)](https://huggingface.co/cortecs/phi-4-FP8-Dynamic)__ |
|
20 |
| Avg. | 68.67 | 68.87 |
|
@@ -48,7 +47,7 @@ Install **vLLM** and
|
|
48 |
run the [server](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html#openai-compatible-server):
|
49 |
|
50 |
```
|
51 |
-
python -m vllm.entrypoints.openai.api_server --model cortecs/phi-4-FP8-Dynamic
|
52 |
```
|
53 |
Access the model:
|
54 |
```
|
|
|
7 |
|
8 |
The phi-4 model is a cutting-edge open-source LLM developed using a diverse mix of synthetic datasets, curated public domain web content, and acquired academic resources, including books and Q&A datasets. This deliberate data selection ensures the training of compact yet highly capable models with an emphasis on quality and advanced reasoning. To further enhance its performance, phi-4 underwent a rigorous alignment process that included supervised fine-tuning and direct preference optimization, resulting in precise instruction adherence and robust safety measures.
|
9 |
## Evaluations
|
10 |
+
This model provides an accuracy recovery of 99.68%.
|
11 |
|
12 |
| __English__ | __[phi-4](https://huggingface.co/microsoft/phi-4)__ | __[phi-4-FP8-Dynamic (this)](https://huggingface.co/cortecs/phi-4-FP8-Dynamic)__ |
|
13 |
|:--------------|:------------------------------------------------------|:-----------------------------------------------------------------------------------|
|
14 |
| Avg. | 70.75 | 70.7 |
|
15 |
| Arc | 68.7 | 68.7 |
|
16 |
| Hellaswag | 72.8 | 72.7 |
|
|
|
17 |
| | | |
|
18 |
| __French__ | __[phi-4](https://huggingface.co/microsoft/phi-4)__ | __[phi-4-FP8-Dynamic (this)](https://huggingface.co/cortecs/phi-4-FP8-Dynamic)__ |
|
19 |
| Avg. | 68.67 | 68.87 |
|
|
|
47 |
run the [server](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html#openai-compatible-server):
|
48 |
|
49 |
```
|
50 |
+
python -m vllm.entrypoints.openai.api_server --model cortecs/phi-4-FP8-Dynamic --max-model-len 16384
|
51 |
```
|
52 |
Access the model:
|
53 |
```
|