kenhktsui
/

llama3.1-8b-instruct-thinking-sft-merged-gguf

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Uploaded model

Developed by: kenhktsui
License: apache-2.0
Finetuned from model : unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month: 83

GGUF

Model size

8.03B params

Architecture

llama

4-bit

Inference API

Unable to determine this model’s pipeline type. Check the docs .

Model tree for kenhktsui/llama3.1-8b-instruct-thinking-sft-merged-gguf

Base model

meta-llama/Llama-3.1-8B

Finetuned

meta-llama/Llama-3.1-8B-Instruct

Quantized

unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit

Quantized

(324)

this model

Collection including kenhktsui/llama3.1-8b-instruct-thinking-sft-merged-gguf

LongTalk

A Very Long Chain-of-Thought Dataset for Reasoning Model Post-Training • 5 items • Updated 4 days ago