Edit Models filters

Inference status

Misc

8-bit precision

Misc with no match

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

18

Full-text search

Active filters: quark

fxmarty/llama-tiny-testing-quark-indev

Updated Oct 3, 2024 • 4

fxmarty/llama-tiny-int4-per-group-sym

Updated Oct 25, 2024 • 32

fxmarty/llama-tiny-w-fp8-a-fp8

Updated Oct 22, 2024 • 29

fxmarty/llama-tiny-w-fp8-a-fp8-o-fp8

Updated Oct 22, 2024 • 21

fxmarty/llama-tiny-w-int8-per-tensor

Updated Oct 22, 2024 • 75

fxmarty/llama-small-int4-per-group-sym-awq

Updated Oct 29, 2024 • 35

fxmarty/quark-legacy-int8

Updated Oct 10, 2024 • 246

fxmarty/llama-tiny-w-int8-b-int8-per-tensor

Updated Oct 22, 2024 • 37

fxmarty/llama-small-int4-per-group-sym-awq-old

Updated Oct 25, 2024 • 1

amd-quark/llama-tiny-w-int8-per-tensor

Updated 20 days ago • 129

amd-quark/llama-tiny-w-int8-b-int8-per-tensor

Updated 20 days ago • 120

amd-quark/llama-tiny-w-fp8-a-fp8

Updated 20 days ago • 117

amd-quark/llama-tiny-w-fp8-a-fp8-o-fp8

Updated 20 days ago • 117

amd-quark/llama-tiny-int4-per-group-sym

Updated 20 days ago • 121

amd-quark/llama-small-int4-per-group-sym-awq

Updated 20 days ago • 116

amd-quark/quark-legacy-int8

Updated 20 days ago • 120

amd/Llama-3.1-8B-Instruct-FP8-KV-Quark-test

Updated about 5 hours ago

amd/Llama-3.1-8B-Instruct-w-int8-a-int8-sym-test

Updated about 5 hours ago