juntaoyuan
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -32,13 +32,15 @@ tags:
|
|
32 |
|
33 |
- Context size: `384`
|
34 |
|
|
|
|
|
35 |
- Run as LlamaEdge service
|
36 |
|
37 |
```bash
|
38 |
wasmedge --dir .:. --nn-preload default:GGML:AUTO:all-MiniLM-L6-v2-ggml-model-f16.gguf \
|
39 |
llama-api-server.wasm \
|
40 |
--prompt-template llama-2-chat \
|
41 |
-
--ctx-size
|
42 |
--model-name all-MiniLM-L6-v2
|
43 |
```
|
44 |
|
|
|
32 |
|
33 |
- Context size: `384`
|
34 |
|
35 |
+
- Vector size: `256`
|
36 |
+
|
37 |
- Run as LlamaEdge service
|
38 |
|
39 |
```bash
|
40 |
wasmedge --dir .:. --nn-preload default:GGML:AUTO:all-MiniLM-L6-v2-ggml-model-f16.gguf \
|
41 |
llama-api-server.wasm \
|
42 |
--prompt-template llama-2-chat \
|
43 |
+
--ctx-size 256 \
|
44 |
--model-name all-MiniLM-L6-v2
|
45 |
```
|
46 |
|