Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
QuantFactory
/
Llama-3.1-Minitron-4B-Width-Base-GGUF
like
1
Follow
Quant Factory
357
GGUF
Inference Endpoints
arxiv:
2408.11796
arxiv:
2009.03300
arxiv:
2407.14679
License:
nvidia-open-model-license
Model card
Files
Files and versions
Community
Deploy
Use this model
1c1eda5
Llama-3.1-Minitron-4B-Width-Base-GGUF
1 contributor
History:
6 commits
aashish1904
Upload Llama-3.1-Minitron-4B-Width-Base.Q8_0.gguf with huggingface_hub
1c1eda5
verified
5 months ago
.gitattributes
1.84 kB
Upload Llama-3.1-Minitron-4B-Width-Base.Q8_0.gguf with huggingface_hub
5 months ago
Llama-3.1-Minitron-4B-Width-Base.Q4_0.gguf
2.65 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q4_0.gguf with huggingface_hub
5 months ago
Llama-3.1-Minitron-4B-Width-Base.Q4_1.gguf
2.91 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q4_1.gguf with huggingface_hub
5 months ago
Llama-3.1-Minitron-4B-Width-Base.Q4_K_M.gguf
2.78 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q4_K_M.gguf with huggingface_hub
5 months ago
Llama-3.1-Minitron-4B-Width-Base.Q8_0.gguf
4.8 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q8_0.gguf with huggingface_hub
5 months ago
README.md
6.19 kB
Upload README.md with huggingface_hub
5 months ago