Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
quark
8-bit precision
Misc with no match
Inference Endpoints
AutoTrain Compatible
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
18
Full-text search
Edit filters
Sort: Trending
Active filters:
quark
Clear all
fxmarty/llama-tiny-testing-quark-indev
Updated
Oct 3, 2024
•
4
fxmarty/llama-tiny-int4-per-group-sym
Updated
Oct 25, 2024
•
32
fxmarty/llama-tiny-w-fp8-a-fp8
Updated
Oct 22, 2024
•
29
fxmarty/llama-tiny-w-fp8-a-fp8-o-fp8
Updated
Oct 22, 2024
•
21
fxmarty/llama-tiny-w-int8-per-tensor
Updated
Oct 22, 2024
•
75
fxmarty/llama-small-int4-per-group-sym-awq
Updated
Oct 29, 2024
•
35
fxmarty/quark-legacy-int8
Updated
Oct 10, 2024
•
246
fxmarty/llama-tiny-w-int8-b-int8-per-tensor
Updated
Oct 22, 2024
•
37
fxmarty/llama-small-int4-per-group-sym-awq-old
Updated
Oct 25, 2024
•
1
amd-quark/llama-tiny-w-int8-per-tensor
Updated
20 days ago
•
129
amd-quark/llama-tiny-w-int8-b-int8-per-tensor
Updated
20 days ago
•
120
amd-quark/llama-tiny-w-fp8-a-fp8
Updated
20 days ago
•
117
amd-quark/llama-tiny-w-fp8-a-fp8-o-fp8
Updated
20 days ago
•
117
amd-quark/llama-tiny-int4-per-group-sym
Updated
20 days ago
•
121
amd-quark/llama-small-int4-per-group-sym-awq
Updated
20 days ago
•
116
amd-quark/quark-legacy-int8
Updated
20 days ago
•
120
amd/Llama-3.1-8B-Instruct-FP8-KV-Quark-test
Updated
about 5 hours ago
amd/Llama-3.1-8B-Instruct-w-int8-a-int8-sym-test
Updated
about 5 hours ago