ngxson
/

MiniThinky-1B-Llama-3.2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

MiniThinky-1B-Llama-3.2 / README.md

ngxson's picture

ngxson HF staff

Update README.md

41bfe1f verified 12 days ago

|

471 Bytes

	---
	library_name: transformers
	tags:
	- trl
	- sft
	base_model:
	- meta-llama/Llama-3.2-1B-Instruct
	datasets:
	- ngxson/MiniThinky-dataset
	---

	# MiniThinky 1B

	My first trial to fine tune a small model to add reasoning capability.

	Chat template is the same with llama 3, but the response will be as follow:

	```
	<\|thinking\|>{thinking_process}
	<\|answer\|>
	{real_answer}
	```

	TODO: include more info here + maybe do some benchmarks? (Plz add a discussion if you're interested)