metadata

base_model: Qwen/Qwen2.5-14B-Instruct
tags:
  - fluently-lm
  - fluently-sets
  - demo
  - reasoning
  - text-generation-inference
  - transformers
  - unsloth
  - qwen2
  - trl
  - sft
license: apache-2.0
language:
  - en
datasets:
  - fluently-sets/reasoning-1-1k
pipeline_tag: text-generation

Reasoning-1 1K Demo (Finetune of Qwen2.5-14B-IT on Reasoning-1-1k dataset)

Q4_K_M GGUF-quant available here

This is SFT-finetune Qwen2.5-14B-IT on Reasoning-1-1K dataset. This is far from a perfect model, its main purpose is to show an example of using the dataset.

Base model: Qwen/Qwen2.5-14B-Instruct
Model type: Qwen2ForCausalLM
Number of parameters: 14.8B
Precision: FP16
Training method: SFT
Training dataset: fluently-sets/reasoning-1-1k
Languages: English (mostly)

Trained by Fluently Team (@ehristoforu) with Unsloth AI with love🥰