Edit Models filters

Inference status

Misc

8-bit precision

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

Mixture of Experts

text-embeddings-inference

Carbon Emissions

Models

9,578

Full-text search

Active filters: 8-bit

anokimchen/sd-turbo-openvino-8bit-GPT4vision-calibrated

Text-to-Image • Updated Aug 7, 2024 • 1

shuyuej/Mistral-Nemo-Instruct-2407-GPTQ-INT8

Updated Aug 7, 2024 • 54 • 1

alpindale/Meta-Llama-3.1-70B-Instruct-GPTQ-INT8

Updated Aug 13, 2024 • 80 • 2

FuturisticVibes/Rocinante-12B-v1.1-8.0bpw-h8-exl2

Updated Aug 23, 2024 • 20 • 1

Statuo/Celeste-v1.9-8bpw-EXL2

Text Generation • Updated Aug 17, 2024 • 23 • 1

MaziyarPanahi/SmolLM-1.7B-Instruct-v0.2-GGUF

Text Generation • Updated Aug 18, 2024 • 579 • 7

MaziyarPanahi/Phi-3.5-mini-instruct-GGUF

Text Generation • Updated Aug 20, 2024 • 2.33M • 6

neuralmagic/SmolLM-1.7B-Instruct-quantized.w8a8

Text Generation • Updated Oct 9, 2024 • 22 • 1

KhanhVan/Vistral-7B-Chat-gguf1

Text Generation • Updated Aug 24, 2024 • 27 • 2

Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8

Image-Text-to-Text • Updated Sep 21, 2024 • 7.62k • 20

Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int8

Image-Text-to-Text • Updated Sep 21, 2024 • 3.28k • 11

FuturisticVibes/ArliAI-RPMax-12B-v1.1-8.0bpw-h8-exl2

Updated Sep 1, 2024 • 8 • 2

jadechoghari/aya-23-8B-quantized

Text Generation • Updated Sep 1, 2024 • 65 • 3

MaziyarPanahi/Yi-Coder-9B-Chat-GGUF

Text Generation • Updated Sep 4, 2024 • 2.32M • 2

MaziyarPanahi/DeepSeek-V2.5-GGUF

Text Generation • Updated Sep 11, 2024 • 43.7k • 4

HF1BitLLM/Llama3-8B-1.58-100B-tokens

Text Generation • Updated Sep 19, 2024 • 3k • 166

MaziyarPanahi/solar-pro-preview-instruct-GGUF

Text Generation • Updated Sep 13, 2024 • 2.32M • 22

Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int8

Image-Text-to-Text • Updated Sep 24, 2024 • 5.01k • 7

Qwen/Qwen2.5-0.5B-Instruct-GPTQ-Int8

Text Generation • Updated Oct 9, 2024 • 1.28k • 7

Qwen/Qwen2.5-1.5B-Instruct-GPTQ-Int8

Text Generation • Updated Oct 9, 2024 • 2.19k • 2

Qwen/Qwen2.5-7B-Instruct-GPTQ-Int8

Text Generation • Updated Oct 18, 2024 • 9.78k • 10

Qwen/Qwen2.5-14B-Instruct-GPTQ-Int8

Text Generation • Updated Oct 9, 2024 • 5.84k • 12

Qwen/Qwen2.5-32B-Instruct-GPTQ-Int8

Text Generation • Updated Oct 9, 2024 • 4.05k • 8

Qwen/Qwen2.5-72B-Instruct-GPTQ-Int8

Text Generation • Updated Oct 9, 2024 • 7.79k • 14

LoneStriker/Mistral-Small-Instruct-2409-8.0bpw-h8-exl2

Updated Sep 17, 2024 • 25 • 5

DewEfresh/pixtral-12b-8bit

Image-Text-to-Text • Updated Sep 18, 2024 • 84 • 12

MaziyarPanahi/Qwen2.5-1.5B-Instruct-GGUF

Text Generation • Updated Sep 18, 2024 • 2.32M • 2

MaziyarPanahi/Qwen2.5-7B-Instruct-GGUF

Text Generation • Updated Sep 18, 2024 • 2.33M • 8

brunopio/Llama3-8B-1.58-100B-tokens-GGUF

Text Generation • Updated Sep 19, 2024 • 961k • 12

Qwen/Qwen2.5-Coder-7B-Instruct-GPTQ-Int8

Text Generation • Updated Nov 18, 2024 • 810 • 3