Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ThomasBaruzier
/
Llama-3.1-Minitron-4B-Width-Base-GGUF
like
3
GGUF
Inference Endpoints
arxiv:
2009.03300
arxiv:
2407.14679
License:
nvidia-open-model-license
Model card
Files
Files and versions
Community
Deploy
Use this model
cbba92f
Llama-3.1-Minitron-4B-Width-Base-GGUF
1 contributor
History:
16 commits
ThomasBaruzier
Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_M.gguf
cbba92f
verified
5 months ago
.gitattributes
Safe
2.21 kB
Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_M.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ1_M.gguf
Safe
1.29 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ1_M.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ1_S.gguf
Safe
1.21 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ1_S.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ2_M.gguf
Safe
1.72 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_M.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ2_S.gguf
Safe
1.63 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_S.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ2_XS.gguf
Safe
1.52 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_XS.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ2_XXS.gguf
Safe
1.41 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_XXS.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-Q2_K.gguf
Safe
1.84 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-Q2_K.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-Q2_K_S.gguf
Safe
1.73 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-Q2_K_S.gguf
5 months ago
README.md
Safe
6.69 kB
Update README.md
5 months ago
imatrix.dat
Safe
3.68 MB
LFS
Upload imatrix.dat
5 months ago