Quantization made by Richard Erkhov.

Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta - GGUF

Model creator: https://huggingface.co/EpistemeAI/
Original model: https://huggingface.co/EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta/

Name	Quant method	Size
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q2_K.gguf	Q2_K	2.96GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.IQ3_XS.gguf	IQ3_XS	3.28GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.IQ3_S.gguf	IQ3_S	3.43GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q3_K_S.gguf	Q3_K_S	3.41GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.IQ3_M.gguf	IQ3_M	3.52GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q3_K.gguf	Q3_K	3.74GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q3_K_M.gguf	Q3_K_M	3.74GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q3_K_L.gguf	Q3_K_L	4.03GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.IQ4_XS.gguf	IQ4_XS	4.18GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q4_0.gguf	Q4_0	4.34GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.IQ4_NL.gguf	IQ4_NL	4.38GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q4_K_S.gguf	Q4_K_S	4.37GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q4_K.gguf	Q4_K	4.58GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q4_K_M.gguf	Q4_K_M	4.58GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q4_1.gguf	Q4_1	4.78GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q5_0.gguf	Q5_0	5.21GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q5_K_S.gguf	Q5_K_S	5.21GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q5_K.gguf	Q5_K	5.34GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q5_K_M.gguf	Q5_K_M	5.34GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q5_1.gguf	Q5_1	5.65GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q6_K.gguf	Q6_K	6.14GB
Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta.Q8_0.gguf	Q8_0	7.95GB

Original model description:

language:

en widget:
text: "My name is Julien and I like to" example_title: "Julien"
text: "My name is Merve and my favorite" example_title: "Merve"

license: apache-2.0 tags: - text-generation-inference - transformers - unsloth - llama - trl base_model: EpistemeAI2/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math model-index: - name: Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta results: - task: type: text-generation name: Text Generation dataset: name: IFEval (0-Shot) type: HuggingFaceH4/ifeval args: num_few_shot: 0 metrics: - type: inst_level_strict_acc and prompt_level_strict_acc value: 72.74 name: strict accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: BBH (3-Shot) type: BBH args: num_few_shot: 3 metrics: - type: acc_norm value: 26.9 name: normalized accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MATH Lvl 5 (4-Shot) type: hendrycks/competition_math args: num_few_shot: 4 metrics: - type: exact_match value: 13.22 name: exact match source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: GPQA (0-shot) type: Idavidrein/gpqa args: num_few_shot: 0 metrics: - type: acc_norm value: 4.03 name: acc_norm source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MuSR (0-shot) type: TAUR-Lab/MuSR args: num_few_shot: 0 metrics: - type: acc_norm value: 4.28 name: acc_norm source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MMLU-PRO (5-shot) type: TIGER-Lab/MMLU-Pro config: main split: test args: num_few_shot: 5 metrics: - type: acc value: 28.26 name: accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta name: Open LLM Leaderboard

KTO Fine tuning!

A KTO version EpistemeAI2/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math

Uploaded model

Developed by: EpistemeAI2
License: apache-2.0
Finetuned from model : EpistemeAI2/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	24.90
IFEval (0-Shot)	72.74
BBH (3-Shot)	26.90
MATH Lvl 5 (4-Shot)	13.22
GPQA (0-shot)	4.03
MuSR (0-shot)	4.28
MMLU-PRO (5-shot)	28.26