README.md · numen-tech/Hermes-3-Llama-3.1-8B-w4a16g128asym at main

Hugging Face

Hermes-3-Llama-3.1-8B-w4a16g128asym / README.md

numen-tech

Add weights

e4a6264 22 days ago

preview code

raw

history blame contribute delete

563 Bytes

metadata

language:
  - en
license: llama3
tags:
  - Llama-3
  - instruct
  - finetune
  - chatml
  - gpt4
  - synthetic data
  - distillation
  - function calling
  - json mode
  - axolotl
  - roleplaying
  - chat
base_model: NousResearch/Hermes-3-Llama-3.1-8B
base_model_relation: quantized
library_name: mlc-llm
pipeline_tag: text-generation

4-bit OmniQuant quantized version of Hermes-3-Llama-3.1-8B for inference with the Private LLM app.