Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ThomasBaruzier
/
Llama-3.1-Minitron-4B-Width-Base-GGUF
like
3
GGUF
Inference Endpoints
arxiv:
2009.03300
arxiv:
2407.14679
License:
nvidia-open-model-license
Model card
Files
Files and versions
Community
Deploy
Use this model
72d3578
Llama-3.1-Minitron-4B-Width-Base-GGUF
1 contributor
History:
9 commits
ThomasBaruzier
Upload imatrix.dat
72d3578
verified
5 months ago
.gitattributes
Safe
1.73 kB
Upload imatrix.dat
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ1_M.gguf
Safe
1.29 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ1_M.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ1_S.gguf
Safe
1.21 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ1_S.gguf
5 months ago
README.md
Safe
6.76 kB
Update README.md
5 months ago
imatrix.dat
Safe
3.68 MB
LFS
Upload imatrix.dat
5 months ago