Schaapje 2B Chat V1.0
Collection
4 items
•
Updated
This is a collection of GGUF files created from Schaapje-2B-Chat-V1.0
It contains the files in the following quantization formats:
Q5_0
, Q5_K_M
, Q6_K
, Q8_0
Before you can use the GGUF files you need to clone llama.cpp repository and install it following the official guide.
Experimenting with the llama.cpp parameters can have a big impact on the quality of the generated text. It is therefore recommended to do your own experimentation with different settings. In my own experiments it looks like quantization 'Q5_0' or better gives good quality.