numen-tech's picture
Add weights
e4a6264
metadata
language:
  - en
license: llama3
tags:
  - Llama-3
  - instruct
  - finetune
  - chatml
  - gpt4
  - synthetic data
  - distillation
  - function calling
  - json mode
  - axolotl
  - roleplaying
  - chat
base_model: NousResearch/Hermes-3-Llama-3.1-8B
base_model_relation: quantized
library_name: mlc-llm
pipeline_tag: text-generation

4-bit OmniQuant quantized version of Hermes-3-Llama-3.1-8B for inference with the Private LLM app.