--- datasets: - PowerInfer/QWQ-LONGCOT-500K - PowerInfer/LONGCOT-Refine-500K base_model: - Qwen/Qwen2.5-3B-Instruct pipeline_tag: text-generation language: - en library_name: transformers tags: - llama-cpp - SmallThinker-3B - gguf - Q8_0 - 3b - SmallThinker - qwen - llama-cpp - PowerInfer - code - math - chat - roleplay - text-generation - safetensors - nlp - code --- # roleplaiapp/SmallThinker-3B-Preview-Q8_0-GGUF **Repo:** `roleplaiapp/SmallThinker-3B-Preview-Q8_0-GGUF` **Original Model:** `SmallThinker-3B` **Organization:** `PowerInfer` **Quantized File:** `smallthinker-3b-preview-q8_0.gguf` **Quantization:** `GGUF` **Quantization Method:** `Q8_0` **Use Imatrix:** `False` **Split Model:** `False` ## Overview This is an GGUF Q8_0 quantized version of [SmallThinker-3B](https://huggingface.co/PowerInfer/SmallThinker-3B-Preview). ## Quantization By I often have idle A100 GPUs while building/testing and training the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful. Andrew Webby @ [RolePlai](https://roleplai.app/)