YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Quantization made by Richard Erkhov.

Github

Discord

Request more models

Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta - GGUF

Name Quant method Size
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q2_K.gguf Q2_K 2.96GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.IQ3_XS.gguf IQ3_XS 3.28GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.IQ3_S.gguf IQ3_S 3.43GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q3_K_S.gguf Q3_K_S 3.41GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.IQ3_M.gguf IQ3_M 3.52GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q3_K.gguf Q3_K 3.74GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q3_K_M.gguf Q3_K_M 3.74GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q3_K_L.gguf Q3_K_L 4.03GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.IQ4_XS.gguf IQ4_XS 4.18GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q4_0.gguf Q4_0 4.34GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.IQ4_NL.gguf IQ4_NL 4.38GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q4_K_S.gguf Q4_K_S 4.37GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q4_K.gguf Q4_K 4.58GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q4_K_M.gguf Q4_K_M 4.58GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q4_1.gguf Q4_1 4.78GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q5_0.gguf Q5_0 5.21GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q5_K_S.gguf Q5_K_S 5.21GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q5_K.gguf Q5_K 5.34GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q5_K_M.gguf Q5_K_M 5.34GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q5_1.gguf Q5_1 5.65GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q6_K.gguf Q6_K 6.14GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q8_0.gguf Q8_0 7.95GB

Original model description:

language:

  • en widget:
  • text: "My name is Julien and I like to" example_title: "Julien"
  • text: "My name is Merve and my favorite" example_title: "Merve"

license: apache-2.0 tags: - text-generation-inference - transformers - unsloth - llama - trl base_model: EpistemeAI2/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math model-index: - name: Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta results: - task: type: text-generation name: Text Generation dataset: name: IFEval (0-Shot) type: HuggingFaceH4/ifeval args: num_few_shot: 0 metrics: - type: inst_level_strict_acc and prompt_level_strict_acc value: 72.74 name: strict accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: BBH (3-Shot) type: BBH args: num_few_shot: 3 metrics: - type: acc_norm value: 26.9 name: normalized accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MATH Lvl 5 (4-Shot) type: hendrycks/competition_math args: num_few_shot: 4 metrics: - type: exact_match value: 13.22 name: exact match source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: GPQA (0-shot) type: Idavidrein/gpqa args: num_few_shot: 0 metrics: - type: acc_norm value: 4.03 name: acc_norm source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MuSR (0-shot) type: TAUR-Lab/MuSR args: num_few_shot: 0 metrics: - type: acc_norm value: 4.28 name: acc_norm source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MMLU-PRO (5-shot) type: TIGER-Lab/MMLU-Pro config: main split: test args: num_few_shot: 5 metrics: - type: acc value: 28.26 name: accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta name: Open LLM Leaderboard

KTO Fine tuning!

A KTO version EpistemeAI2/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math

Uploaded model

  • Developed by: EpistemeAI2
  • License: apache-2.0
  • Finetuned from model : EpistemeAI2/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 24.90
IFEval (0-Shot) 72.74
BBH (3-Shot) 26.90
MATH Lvl 5 (4-Shot) 13.22
GPQA (0-shot) 4.03
MuSR (0-shot) 4.28
MMLU-PRO (5-shot) 28.26
Downloads last month
3
GGUF
Model size
8.03B params
Architecture
llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API
Unable to determine this model's library. Check the docs .