nintwentydo
commited on
Commit
•
1ff7a22
1
Parent(s):
9f2abb4
Update README.md
Browse files
README.md
CHANGED
@@ -35,4 +35,6 @@ Example VLLM usage
|
|
35 |
vllm serve nintwentydo/pixtral-12b-FP8-dynamic-FP8-KV-cache --quantization fp8 --kv-cache-dtype fp8
|
36 |
```
|
37 |
|
38 |
-
Supported on Nvidia GPUs with compute capability > 8.9 (Ada Lovelace, Hopper).
|
|
|
|
|
|
35 |
vllm serve nintwentydo/pixtral-12b-FP8-dynamic-FP8-KV-cache --quantization fp8 --kv-cache-dtype fp8
|
36 |
```
|
37 |
|
38 |
+
Supported on Nvidia GPUs with compute capability > 8.9 (Ada Lovelace, Hopper).
|
39 |
+
|
40 |
+
**Edit:** Something seems to be wrong with the tokenizer. If you have any issues add `--tokenizer mistral-community/pixtral-12b` to your VLLM command line args.
|