Dracones commited on
Commit
94a24bd
1 Parent(s): e2fde3c

Upload folder using huggingface_hub

Browse files
Athene-V2-Chat.json ADDED
The diff for this file is too large to render. See raw diff
 
Llama-3.1-Nemotron-70B-Instruct.json ADDED
The diff for this file is too large to render. See raw diff
 
QwQ-32B-Preview.json ADDED
The diff for this file is too large to render. See raw diff
 
Qwen2.5-32B-Instruct.json ADDED
The diff for this file is too large to render. See raw diff
 
Qwen2.5-72B-Instruct.json ADDED
The diff for this file is too large to render. See raw diff
 
Qwen2.5-Coder-32B-Instruct.json ADDED
The diff for this file is too large to render. See raw diff
 
README.md CHANGED
@@ -22,5 +22,12 @@ This repository contains EXL2 measurement files for quants made here.
22
  | `CodeQwen1.5-7B.json` | EXL2 measurement file for CodeQwen1.5-7B | [Qwen/CodeQwen1.5-7B](https://huggingface.co/Qwen/CodeQwen1.5-7B) |
23
  | `Mixtral-8x22B-Instruct-v0.1.json` | EXL2 measurement file for Mixtral-8x22B-Instruct-v0.1 | [mistralai/Mixtral-8x22B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x22B-Instruct-v0.1) |
24
  | `Llama-3-Lumimaid-70B-v0.1.json` | EXL2 measurement file for Llama-3-Lumimaid-70B-v0.1 | [NeverSleep/Llama-3-Lumimaid-70B-v0.1](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-70B-v0.1) |
 
 
 
 
 
 
 
25
 
26
 
 
22
  | `CodeQwen1.5-7B.json` | EXL2 measurement file for CodeQwen1.5-7B | [Qwen/CodeQwen1.5-7B](https://huggingface.co/Qwen/CodeQwen1.5-7B) |
23
  | `Mixtral-8x22B-Instruct-v0.1.json` | EXL2 measurement file for Mixtral-8x22B-Instruct-v0.1 | [mistralai/Mixtral-8x22B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x22B-Instruct-v0.1) |
24
  | `Llama-3-Lumimaid-70B-v0.1.json` | EXL2 measurement file for Llama-3-Lumimaid-70B-v0.1 | [NeverSleep/Llama-3-Lumimaid-70B-v0.1](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-70B-v0.1) |
25
+ | `Qwen2.5-72B-Instruct.json` | EXL2 measurement file for Qwen2.5-72B-Instruct | [Qwen/Qwen2.5-72B-Instruct](https://huggingface.co/Qwen/Qwen2.5-72B-Instruct) |
26
+ | `Qwen2.5-Coder-32B-Instruct.json` | EXL2 measurement file for Qwen2.5-Coder-32B-Instruct | [Qwen/Qwen2.5-Coder-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Coder-32B-Instruct) |
27
+ | `QwQ-32B-Preview.json` | EXL2 measurement file for QwQ-32B-Preview | [Qwen/QwQ-32B-Preview](https://huggingface.co/Qwen/QwQ-32B-Preview) |
28
+ | `Athene-V2-Chat.json` | EXL2 measurement file for Athene-V2-Chat | [Nexusflow/Athene-V2-Chat](https://huggingface.co/Nexusflow/Athene-V2-Chat) |
29
+ | `Llama-3.1-Nemotron-70B-Instruct.json` | EXL2 measurement file for Llama-3.1-Nemotron-70B-Instruct | [nvidia/Llama-3.1-Nemotron-70B-Instruct-HF](https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF) |
30
+ | `Qwen2.5-32B-Instruct.json` | EXL2 measurement file for Qwen2.5-32B-Instruct | [Qwen/Qwen2.5-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-32B-Instruct) |
31
+
32
 
33