-
-
-
-
-
-
Inference status
Active filters:
8-bit
anokimchen/sd-turbo-openvino-8bit-GPT4vision-calibrated
Text-to-Image
•
Updated
•
1
shuyuej/Mistral-Nemo-Instruct-2407-GPTQ-INT8
Updated
•
54
•
1
alpindale/Meta-Llama-3.1-70B-Instruct-GPTQ-INT8
Updated
•
80
•
2
FuturisticVibes/Rocinante-12B-v1.1-8.0bpw-h8-exl2
Updated
•
20
•
1
Statuo/Celeste-v1.9-8bpw-EXL2
Text Generation
•
Updated
•
23
•
1
MaziyarPanahi/SmolLM-1.7B-Instruct-v0.2-GGUF
Text Generation
•
Updated
•
579
•
7
MaziyarPanahi/Phi-3.5-mini-instruct-GGUF
Text Generation
•
Updated
•
2.33M
•
6
neuralmagic/SmolLM-1.7B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
22
•
1
KhanhVan/Vistral-7B-Chat-gguf1
Text Generation
•
Updated
•
27
•
2
Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
•
7.62k
•
20
Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
•
3.28k
•
11
FuturisticVibes/ArliAI-RPMax-12B-v1.1-8.0bpw-h8-exl2
Updated
•
8
•
2
jadechoghari/aya-23-8B-quantized
Text Generation
•
Updated
•
65
•
3
MaziyarPanahi/Yi-Coder-9B-Chat-GGUF
Text Generation
•
Updated
•
2.32M
•
2
MaziyarPanahi/DeepSeek-V2.5-GGUF
Text Generation
•
Updated
•
43.7k
•
4
HF1BitLLM/Llama3-8B-1.58-100B-tokens
Text Generation
•
Updated
•
3k
•
166
MaziyarPanahi/solar-pro-preview-instruct-GGUF
Text Generation
•
Updated
•
2.32M
•
22
Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
•
5.01k
•
7
Qwen/Qwen2.5-0.5B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
1.28k
•
7
Qwen/Qwen2.5-1.5B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
2.19k
•
2
Qwen/Qwen2.5-7B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
9.78k
•
10
Qwen/Qwen2.5-14B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
5.84k
•
12
Qwen/Qwen2.5-32B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
4.05k
•
8
Qwen/Qwen2.5-72B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
7.79k
•
14
LoneStriker/Mistral-Small-Instruct-2409-8.0bpw-h8-exl2
Updated
•
25
•
5
DewEfresh/pixtral-12b-8bit
Image-Text-to-Text
•
Updated
•
84
•
12
MaziyarPanahi/Qwen2.5-1.5B-Instruct-GGUF
Text Generation
•
Updated
•
2.32M
•
2
MaziyarPanahi/Qwen2.5-7B-Instruct-GGUF
Text Generation
•
Updated
•
2.33M
•
8
brunopio/Llama3-8B-1.58-100B-tokens-GGUF
Text Generation
•
Updated
•
961k
•
12
Qwen/Qwen2.5-Coder-7B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
810
•
3