-
-
-
-
-
-
Inference status
Active filters:
vllm
aashish1904/Ministral-8B-Instruct-2410-HF-Q4_K_M-GGUF
Updated
•
6
•
1
QuantFactory/TouchNight-Ministral-8B-Instruct-2410-HF-GGUF
Updated
•
64
•
2
aashish1904/Ministral-8B-Instruct-2410-HF-Q2_K-GGUF
Updated
•
6
•
2
GrimsenClory/Ministral-8B-Instruct-2410-Q6_K-GGUF
QuantFactory/Ministral-8B-Instruct-2410-GGUF
Updated
•
506
•
2
gphorvath/Ministral-8B-Instruct-2410-Q4_K_M-GGUF
Gleisson1/Ministral-8B-Instruct-2410-HF-4bit
Updated
•
83
paultimothymooney/Ministral-8B-Instruct-2410-Q8_0-GGUF
paultimothymooney/Ministral-8B-Instruct-2410-Q4_K_M-GGUF
LouiSeHU/Mistral-Small-Instruct-2409-Q4_0-GGUF
yejingfu/nmagic-Meta-Llama-3.1-8B-Instruct-FP8
Text Generation
•
Updated
•
59.3k
mysticbeing/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-DYNAMIC
Text Generation
•
Updated
•
79
•
3
Ritvik19/Ministral-8B-Instruct-2410-Q4_K_M-GGUF
Gustav0-Freind/missmall
yejingfu/nmagic-Meta-Llama-3.1-70B-Instruct-FP8
Text Generation
•
Updated
•
5
neuralmagic/Sparse-Llama-3.1-8B-gsm8k-2of4
Text Generation
•
Updated
•
54
•
1
SicariusSicariiStuff/DeepSeek-Coder-V2-Instruct-FP8
neuralmagic/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic
Text Generation
•
Updated
•
218
•
1
yejingfu/nmagic-Meta-Llama-3-70B-Instruct-FP8
Updated
•
25
neuralmagic/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic
Text Generation
•
Updated
•
20
Model-SafeTensors/Meta-Llama-3-8B-Instruct-FP8
Updated
•
4.92k
mistral-community/Pixtral-Large-Instruct-2411
Image-Text-to-Text
•
Updated
•
75
•
6
mgoin/Pixtral-Large-Instruct-2411
Updated
MikeRoz/mistralai_Mistral-Large-Instruct-2411-4.0bpw-h6-exl2
Updated
•
13
•
2
neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic
Text Generation
•
Updated
•
47
•
1
ijohn07/Ministral-8B-Instruct-2410-Q8_0-GGUF
MikeRoz/mistralai_Mistral-Large-Instruct-2411-3.0bpw-h6-exl2
Updated
•
1
•
1
bartowski/Mistral-Large-Instruct-2411-exl2
Text Generation
•
Updated
•
66
•
3
MikeRoz/mistralai_Mistral-Large-Instruct-2411-2.5bpw-h6-exl2
Updated
•
6
•
2
MikeRoz/mistralai_Mistral-Large-Instruct-2411-5.0bpw-h6-exl2
Updated
•
2
•
1