metadata
base_model: Qwen/Qwen2.5-14B-Instruct
tags:
- fluently-lm
- fluently-sets
- demo
- reasoning
- text-generation-inference
- transformers
- unsloth
- qwen2
- trl
- sft
license: apache-2.0
language:
- en
datasets:
- fluently-sets/reasoning-1-1k
pipeline_tag: text-generation
Reasoning-1 1K Demo (Finetune of Qwen2.5-14B-IT on Reasoning-1-1k dataset)
Q4_K_M GGUF-quant available here
This is SFT-finetune Qwen2.5-14B-IT on Reasoning-1-1K dataset. This is far from a perfect model, its main purpose is to show an example of using the dataset.
- Base model: Qwen/Qwen2.5-14B-Instruct
- Model type: Qwen2ForCausalLM
- Number of parameters: 14.8B
- Precision: FP16
- Training method: SFT
- Training dataset: fluently-sets/reasoning-1-1k
- Languages: English (mostly)
Trained by Fluently Team (@ehristoforu) with Unsloth AI with love🥰