Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,14 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
# QwQ-32B-Preview-bnb-4bit
|
2 |
|
3 |
## Introduction
|
@@ -9,4 +20,4 @@ QwQ-32B-Preview-bnb-4bit is a 4-bit quantized version of the [QwQ-32B-Preview](h
|
|
9 |
- **Quantization:** 4-bit using Bits and Bytes (bnb)
|
10 |
- **Base Model:** [Qwen/QwQ-32B-Preview](https://huggingface.co/Qwen/QwQ-32B-Preview)
|
11 |
- **Parameters:** 32.5 billion
|
12 |
-
- **Context Length:** Up to 32,768 tokens
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
base_model:
|
6 |
+
- Qwen/QwQ-32B-Preview
|
7 |
+
pipeline_tag: text-generation
|
8 |
+
tags:
|
9 |
+
- chat
|
10 |
+
- qwen2
|
11 |
+
---
|
12 |
# QwQ-32B-Preview-bnb-4bit
|
13 |
|
14 |
## Introduction
|
|
|
20 |
- **Quantization:** 4-bit using Bits and Bytes (bnb)
|
21 |
- **Base Model:** [Qwen/QwQ-32B-Preview](https://huggingface.co/Qwen/QwQ-32B-Preview)
|
22 |
- **Parameters:** 32.5 billion
|
23 |
+
- **Context Length:** Up to 32,768 tokens
|