roleplaiapp
/

SmallThinker-3B-Preview-IQ4_XS-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

roleplaiapp commited on 2 days ago

Commit

43dc38d

·

verified ·

1 Parent(s): 300ddfc

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md +49 -0

README.md ADDED Viewed

	@@ -0,0 +1,49 @@

+---
+datasets:
+- PowerInfer/QWQ-LONGCOT-500K
+- PowerInfer/LONGCOT-Refine-500K
+base_model:
+- Qwen/Qwen2.5-3B-Instruct
+pipeline_tag: text-generation
+language:
+- en
+library_name: transformers
+tags:
+- llama-cpp
+- imatrix
+- gguf
+- Q8_0
+- 3b
+- SmallThinker
+- qwen
+- llama-cpp
+- PowerInfer
+- code
+- math
+- chat
+- roleplay
+- text-generation
+- safetensors
+- nlp
+- code
+---
+# roleplaiapp/SmallThinker-3B-Preview-IQ4_XS-GGUF
+**Repo:** `roleplaiapp/SmallThinker-3B-Preview-IQ4_XS-GGUF`
+**Original Model:** `imatrix`
+**Organization:** `PowerInfer`
+**Quantized File:** `smallthinker-3b-preview-iq4_xs-imat.gguf`
+**Quantization:** `GGUF`
+**Quantization Method:** `Q8_0`
+**Use Imatrix:** `True`
+**Split Model:** `False`
+## Overview
+This is an GGUF Q8_0 quantized version of [imatrix](https://huggingface.co/PowerInfer/SmallThinker-3B-Preview).
+## Quantization By
+I often have idle A100 GPUs while building/testing and training the RP app, so I put them to use quantizing models.
+I hope the community finds these quantizations useful.
+Andrew Webby @ [RolePlai](https://roleplai.app/)