-
-
-
-
-
-
Inference status
Active filters:
fp8
amd/Llama-3.2-3B-Instruct-FP8-KV
Updated
•
11
amd/Llama-3.2-3B-FP8-KV
Updated
•
15
amd/Llama-3.2-1B-Instruct-FP8-KV
Updated
•
19
amd/Llama-3.2-1B-FP8-KV
Updated
•
22
SicariusSicariiStuff/Dusk_Rainbow_FP8
amd/Llama-3.2-90B-Vision-Instruct-FP8-KV
Updated
•
225
soprasteria/Mixtral-8x7B-Instruct-v0.1-FP8
Updated
•
16
neuralmagic/Phi-3.5-mini-instruct-FP8-KV
Text Generation
•
Updated
•
148
•
2
CalamitousFelicitousness/SorcererLM-8x22b-FP8-Dynamic
John6666/stoiqo-afrodite-fluxxl-f1dalpha-fp8-flux
Text-to-Image
•
Updated
•
62.5k
•
3
obamaTeo/llama-finetune-8bit-wiki-284-ver2
fxmarty/quark-legacy-fp8
Updated
•
44
amd/jais-13b-chat-FP8
Updated
•
10
neuralmagic/pixtral-12b-FP8-dynamic
Text Generation
•
Updated
•
2.85k
•
7
predibase/Qwen2.5-14B-FP8
CalamitousFelicitousness/banana-2-b-72b-FP8-Dynamic
taozi555/Llama-Guard-3-8B-FP8
ajinkya-tejankar/Mistral-7B-Instruct-v0.2-FP8-UltraChat-2000-KV
Infermatic/Lumimaid-v0.2-70B-FP8-Dynamic
Updated
•
30
predibase/Qwen2.5-32B-Instruct-FP8
Updated
•
307
Infermatic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-Dynamic
Text Generation
•
Updated
•
146
predibase/Mistral-7B-Instruct-v0.2-FP8-UltraChat-2000-KV
neuralmagic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic
Text Generation
•
Updated
•
67.2k
•
14
Infermatic/magnum-v4-72b-FP8-Dynamic
Text Generation
•
Updated
•
908
•
1
amd/dbrx-base-FP8-KV
Updated
•
18
Infermatic/Stellar-Odyssey-12b-v0.0-FP8-Dynamic
Infermatic/Chronos-Platinum-72B-FP8-Dynamic
Infermatic/Nautilus-70B-v0.1-FP8-Dynamic
yejingfu/nmagic-Meta-Llama-3.1-8B-Instruct-FP8
Text Generation
•
Updated
•
49.7k
mysticbeing/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-DYNAMIC
Text Generation
•
Updated
•
89
•
3