Parm V1
Collection
First gen of the Pinkstack advanced reasoning models. Medium quality but better than the original models they are based on.
β’
3 items
β’
Updated
β’
2
This PARM is based on Qwen 2.5 3B which has gotten extra reasoning training parameters so it would have similar outputs to qwen QwQ / O.1 mini (only much, smaller.), We trained using this dataset. it is designed to run on any device, from your phone to high-end PC.
To use this model, you must use a service which supports the GGUF file format. Additionaly, this is the Prompt Template, it uses the qwen2 template.
{{ if .System }}<|system|>
{{ .System }}<|end|>
{{ end }}{{ if .Prompt }}<|user|>
{{ .Prompt }}<|end|>
{{ end }}<|assistant|>
{{ .Response }}<|end|>
Or if you are using an anti prompt: <|end|><|assistant|>
Highly recommended to use with a system prompt.
This model was trained using Unsloth and Huggingface's TRL library.
Used this model? Don't forget to leave a like :)