--- base_model: Khetterman/Llama-3.2-Kapusta-3B-v8 pipeline_tag: text-generation library_name: transformers quantized_by: Khetterman tags: - mergekit - merge - llama - llama-3 - llama-3.2 - 3b - chat - creative - conversational - not-for-all-audiences language: - en - ru --- # Llama-3.2-Kapusta-3B-v8 GGUF Quantizations 🗲 >Small and useful. ![KapustaLogo256.png](https://cdn-uploads.huggingface.co/production/uploads/673125091920e70ac26c8a2e/8r4RQ-i03m84LeEhq6jxf.png) This model was converted to GGUF format using [llama.cpp](https://github.com/ggerganov/llama.cpp). For more information of the model, see the original model card: [Khetterman/Llama-3.2-Kapusta-3B-v8](https://huggingface.co/Khetterman/Llama-3.2-Kapusta-3B-v8). ## Available Quantizations (◕‿◕) | Type | Quantized GGUF Model | Size | |--------|----------------------|------| | Q4_0 | [Khetterman/Llama-3.2-Kapusta-3B-v8-Q4_0.gguf](https://huggingface.co/Khetterman/Llama-3.2-Kapusta-3B-v8-GGUF/blob/main/Llama-3.2-Kapusta-3B-v8-Q4_0.gguf) | 1.99 GiB | | Q6_K | [Khetterman/Llama-3.2-Kapusta-3B-v8-Q6_K.gguf](https://huggingface.co/Khetterman/Llama-3.2-Kapusta-3B-v8-GGUF/blob/main/Llama-3.2-Kapusta-3B-v8-Q6_K.gguf) | 2.76 GiB | | Q8_0 | [Khetterman/Llama-3.2-Kapusta-3B-v8-Q8_0.gguf](https://huggingface.co/Khetterman/Llama-3.2-Kapusta-3B-v8-GGUF/blob/main/Llama-3.2-Kapusta-3B-v8-Q8_0.gguf) | 3.57 GiB | >My thanks to the authors of the original models, your work is incredible. Have a good time 🖤