Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ThomasBaruzier
/
Llama-3.1-Minitron-4B-Width-Base-GGUF
like
3
GGUF
Inference Endpoints
arxiv:
2009.03300
arxiv:
2407.14679
License:
nvidia-open-model-license
Model card
Files
Files and versions
Community
Deploy
Use this model
e24a300
Llama-3.1-Minitron-4B-Width-Base-GGUF
1 contributor
History:
32 commits
ThomasBaruzier
Upload Llama-3.1-Minitron-4B-Width-Base-Q5_K_M.gguf
e24a300
verified
5 months ago
.gitattributes
Safe
3.18 kB
Upload Llama-3.1-Minitron-4B-Width-Base-Q5_K_M.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ1_M.gguf
Safe
1.29 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ1_M.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ1_S.gguf
Safe
1.21 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ1_S.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ2_M.gguf
Safe
1.72 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_M.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ2_S.gguf
Safe
1.63 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_S.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ2_XS.gguf
Safe
1.52 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_XS.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ2_XXS.gguf
Safe
1.41 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_XXS.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ3_M.gguf
Safe
2.18 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ3_M.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ3_S.gguf
Safe
2.11 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ3_S.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ3_XS.gguf
Safe
2.03 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ3_XS.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ3_XXS.gguf
Safe
1.88 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ3_XXS.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ4_NL.gguf
Safe
2.66 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ4_NL.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-IQ4_XS.gguf
Safe
2.54 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-IQ4_XS.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-Q2_K.gguf
Safe
1.84 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-Q2_K.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-Q2_K_S.gguf
Safe
1.73 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-Q2_K_S.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-Q3_K_L.gguf
Safe
2.46 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-Q3_K_L.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-Q3_K_M.gguf
Safe
2.3 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-Q3_K_M.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-Q3_K_S.gguf
Safe
2.1 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-Q3_K_S.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-Q4_K_M.gguf
Safe
2.78 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-Q4_K_M.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-Q4_K_S.gguf
Safe
2.66 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-Q4_K_S.gguf
5 months ago
Llama-3.1-Minitron-4B-Width-Base-Q5_K_M.gguf
Safe
3.23 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base-Q5_K_M.gguf
5 months ago
README.md
Safe
6.69 kB
Update README.md
5 months ago
imatrix.dat
Safe
3.68 MB
LFS
Upload imatrix.dat
5 months ago