nintwentydo commited on
Commit
1ff7a22
1 Parent(s): 9f2abb4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -35,4 +35,6 @@ Example VLLM usage
35
  vllm serve nintwentydo/pixtral-12b-FP8-dynamic-FP8-KV-cache --quantization fp8 --kv-cache-dtype fp8
36
  ```
37
 
38
- Supported on Nvidia GPUs with compute capability > 8.9 (Ada Lovelace, Hopper).
 
 
 
35
  vllm serve nintwentydo/pixtral-12b-FP8-dynamic-FP8-KV-cache --quantization fp8 --kv-cache-dtype fp8
36
  ```
37
 
38
+ Supported on Nvidia GPUs with compute capability > 8.9 (Ada Lovelace, Hopper).
39
+
40
+ **Edit:** Something seems to be wrong with the tokenizer. If you have any issues add `--tokenizer mistral-community/pixtral-12b` to your VLLM command line args.