Spaces:
Build error
Build error
Tom Aarsen
commited on
Commit
·
625b359
1
Parent(s):
6236065
a int8 -> an int8
Browse files
app.py
CHANGED
@@ -82,7 +82,7 @@ Details:
|
|
82 |
1. The query is embedded using the [`mixedbread-ai/mxbai-embed-large-v1`](https://huggingface.co/mixedbread-ai/mxbai-embed-large-v1) SentenceTransformer model.
|
83 |
2. The query is quantized to binary using the `quantize_embeddings` function from the SentenceTransformers library.
|
84 |
3. A binary index (1M binary embeddings; 130MB of memory/disk space) is searched using the quantized query for the top 40 documents.
|
85 |
-
4. The top 40 documents are loaded on the fly from
|
86 |
5. The top 40 documents are rescored using the float32 query and the int8 embeddings to get the top 10 documents.
|
87 |
6. The top 10 documents are sorted by score and displayed.
|
88 |
|
|
|
82 |
1. The query is embedded using the [`mixedbread-ai/mxbai-embed-large-v1`](https://huggingface.co/mixedbread-ai/mxbai-embed-large-v1) SentenceTransformer model.
|
83 |
2. The query is quantized to binary using the `quantize_embeddings` function from the SentenceTransformers library.
|
84 |
3. A binary index (1M binary embeddings; 130MB of memory/disk space) is searched using the quantized query for the top 40 documents.
|
85 |
+
4. The top 40 documents are loaded on the fly from an int8 index on disk (1M int8 embeddings; 0 bytes of memory, 1.19GB of disk space).
|
86 |
5. The top 40 documents are rescored using the float32 query and the int8 embeddings to get the top 10 documents.
|
87 |
6. The top 10 documents are sorted by score and displayed.
|
88 |
|