Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nmerkle
/
Meta-Llama-3-8B-Instruct-ggml-model-Q4_K_M.gguf
like
3
GGUF
Inference Endpoints
conversational
License:
llama3
Model card
Files
Files and versions
Community
Deploy
Use this model
Quantized Meta-Llama-3B-Instruct model
Quantized Meta-Llama-3B-Instruct model
Tested inference on Raspberry PI Model 4 with
llama.cpp
. ~0.5 Tokens per second.
Downloads last month
79
GGUF
Model size
8.03B params
Architecture
llama
4-bit
Q4_K_M
Inference API
Unable to determine this model's library. Check the
docs
.