shaowenchen
commited on
Commit
·
35f3218
1
Parent(s):
641931a
add ./vicuna-7b-v1.5-16k.Q4_K.gguf
Browse files- README.md +3 -1
- vicuna-7b-v1.5-16k.Q4_K.gguf +3 -0
README.md
CHANGED
@@ -40,10 +40,12 @@ tags:
|
|
40 |
|
41 |
Usage:
|
42 |
|
43 |
-
```
|
44 |
docker run --rm -it -p 8000:8000 -v /path/to/models:/models -e MODEL=/models/gguf-model-name.gguf hubimage/llama-cpp-python:latest
|
45 |
```
|
46 |
|
|
|
|
|
47 |
## Provided images
|
48 |
|
49 |
| Name | Quant method | Size |
|
|
|
40 |
|
41 |
Usage:
|
42 |
|
43 |
+
```
|
44 |
docker run --rm -it -p 8000:8000 -v /path/to/models:/models -e MODEL=/models/gguf-model-name.gguf hubimage/llama-cpp-python:latest
|
45 |
```
|
46 |
|
47 |
+
and you can view http://localhost:8000/docs to see the swagger UI.
|
48 |
+
|
49 |
## Provided images
|
50 |
|
51 |
| Name | Quant method | Size |
|
vicuna-7b-v1.5-16k.Q4_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:706312090f71d94c2a04f2d7d2a238f2b3562618283090e26208bd20645ced84
|
3 |
+
size 4081004256
|