Transformers
GGUF
Not-For-All-Audiences
Inference Endpoints
conversational

QuantFactory/L3.1-8B-sunfall-stheno-v0.6.1-GGUF

This is quantized version of crestf411/L3.1-8B-sunfall-stheno-v0.6.1 created using llama.cpp

Original Model Card

Sunfall (2024-07-31) v0.6.1 on top of https://huggingface.co/Sao10K/Llama-3.1-8B-Stheno-v3.4

See https://huggingface.co/crestf411/L3.1-8B-sunfall-v0.6.1-dpo for details on usage.

Downloads last month
7
GGUF
Model size
8.03B params
Architecture
llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model’s pipeline type.

Dataset used to train QuantFactory/L3.1-8B-sunfall-stheno-v0.6.1-GGUF