Update README.md
Browse files
README.md
CHANGED
@@ -35,6 +35,12 @@ chmod +x TriLM_3.9B.llamafile
|
|
35 |
./TriLM_3.9B.llamafile
|
36 |
```
|
37 |
|
|
|
|
|
|
|
|
|
|
|
|
|
38 |
For further information, please see the [llamafile
|
39 |
README](https://github.com/mozilla-ocho/llamafile/).
|
40 |
|
@@ -48,7 +54,9 @@ This is a base model. It hasn't been fine-tuned for chat. It's
|
|
48 |
recommended that the completions interface be used.
|
49 |
|
50 |
It's recommended with the smaller TriLM models (e.g. 99M) that a high
|
51 |
-
repeat penalty be set, e.g. `--repeat-penalty 10`.
|
|
|
|
|
52 |
|
53 |
## Benchmarks
|
54 |
|
|
|
35 |
./TriLM_3.9B.llamafile
|
36 |
```
|
37 |
|
38 |
+
You can also use the command line interface:
|
39 |
+
|
40 |
+
```
|
41 |
+
./TriLM_3.9B.llamafile -p "this is my prompt"
|
42 |
+
```
|
43 |
+
|
44 |
For further information, please see the [llamafile
|
45 |
README](https://github.com/mozilla-ocho/llamafile/).
|
46 |
|
|
|
54 |
recommended that the completions interface be used.
|
55 |
|
56 |
It's recommended with the smaller TriLM models (e.g. 99M) that a high
|
57 |
+
repeat penalty be set, e.g. `--repeat-penalty 10`. When using the CLI
|
58 |
+
mode, this flag is specified by default in the `.args` file embedded
|
59 |
+
within the llamafiles from this repo.
|
60 |
|
61 |
## Benchmarks
|
62 |
|