-
-
-
-
-
-
Inference status
Active filters:
nm-vllm
neuralmagic/TinyLlama-1.1B-Chat-v1.0-pruned2.4
Text Generation
•
Updated
•
28
•
1
neuralmagic/MiniChat-2-3B-pruned2.4
Text Generation
•
Updated
•
17
neuralmagic/OpenHermes-2.5-Mistral-7B-pruned2.4
Text Generation
•
Updated
•
166
neuralmagic/OpenHermes-2.5-Mistral-7B-pruned50
Text Generation
•
Updated
•
132
•
1
neuralmagic/Nous-Hermes-2-SOLAR-10.7B-pruned2.4
Text Generation
•
Updated
•
22
neuralmagic/Nous-Hermes-2-Yi-34B-pruned2.4
Text Generation
•
Updated
•
16
neuralmagic/Nous-Hermes-2-Yi-34B-pruned50
Text Generation
•
Updated
•
15
neuralmagic/zephyr-7b-beta-marlin
Text Generation
•
Updated
•
528
neuralmagic/llama2.c-stories110M-pruned2.4
Text Generation
•
Updated
•
14
neuralmagic/llama2.c-stories110M-pruned50
Text Generation
•
Updated
•
787
neuralmagic/phi-2-pruned50
Text Generation
•
Updated
•
37
neuralmagic/TinyLlama-1.1B-Chat-v1.0-marlin
Text Generation
•
Updated
•
2.73k
•
1
neuralmagic/OpenHermes-2.5-Mistral-7B-marlin
Text Generation
•
Updated
•
728
•
2
neuralmagic/Nous-Hermes-2-Yi-34B-marlin
Text Generation
•
Updated
•
13
•
5
softmax/Llama-2-70b-chat-hf-marlin
Text Generation
•
Updated
•
118
softmax/falcon-180B-chat-marlin
Text Generation
•
Updated
•
17
dtransposed/llama2.c-stories110M-pruned50-compressed-tensors
Text Generation
•
Updated
•
5
nm-testing/llama2.c-stories110M-pruned50-compressed-tensors
Text Generation
•
Updated
•
6
mradermacher/Nous-Hermes-2-SOLAR-10.7B-pruned2.4-GGUF
mradermacher/Nous-Hermes-2-SOLAR-10.7B-pruned2.4-i1-GGUF
Updated
•
266
tensorblock/llama2.c-stories110M-pruned50-GGUF
Updated
•
80
mradermacher/phi-2-pruned50-GGUF
Updated
•
19
mradermacher/llama2.c-stories110M-pruned50-GGUF
Updated
•
30
mradermacher/OpenHermes-2.5-Mistral-7B-pruned50-GGUF
Updated
•
27
mradermacher/MiniChat-2-3B-pruned2.4-GGUF
Updated
•
73
mradermacher/OpenHermes-2.5-Mistral-7B-pruned50-i1-GGUF
Updated
•
29
mradermacher/llama2.c-stories110M-pruned50-i1-GGUF
Updated
•
68
mradermacher/OpenHermes-2.5-Mistral-7B-pruned2.4-GGUF
Updated
•
21
mradermacher/OpenHermes-2.5-Mistral-7B-pruned2.4-i1-GGUF
Updated
•
36
tensorblock/OpenHermes-2.5-Mistral-7B-pruned2.4-GGUF
Updated
•
178