--- library_name: transformers pipeline_tag: text-generation tags: - 4-bit - Q4_K_S - cot - gguf - llama - llama-cpp - re1 - reasoning - text-generation --- # roleplaiapp/Reasoning-Llama-3.1-CoT-RE1-i1-Q4_K_S-GGUF **Repo:** `roleplaiapp/Reasoning-Llama-3.1-CoT-RE1-i1-Q4_K_S-GGUF` **Original Model:** `Reasoning-Llama-3.1-CoT-RE1-i1` **Quantized File:** `Reasoning-Llama-3.1-CoT-RE1.i1-Q4_K_S.gguf` **Quantization:** `GGUF` **Quantization Method:** `Q4_K_S` ## Overview This is a GGUF Q4_K_S quantized version of Reasoning-Llama-3.1-CoT-RE1-i1 ## Quantization By I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful. Andrew Webby @ [RolePlai](https://roleplai.app/).