File size: 563 Bytes
b1dee01
e4a6264
 
b1dee01
e4a6264
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
b1dee01
e4a6264
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
---

language:
- en
license: llama3
tags:
- Llama-3
- instruct
- finetune
- chatml
- gpt4
- synthetic data
- distillation
- function calling
- json mode
- axolotl
- roleplaying
- chat
base_model: NousResearch/Hermes-3-Llama-3.1-8B
base_model_relation: quantized
library_name: mlc-llm
pipeline_tag: text-generation
---


4-bit [OmniQuant](https://arxiv.org/abs/2308.13137) quantized version of [Hermes-3-Llama-3.1-8B](https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B) for inference with the [Private LLM](http://privatellm.app) app.