PracticeLLM
/

SOLAR-tail-10.7B-Merge-v1.0

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

kyujinpy commited on Dec 26, 2023

Commit

d92f86e

·

1 Parent(s): 138d111

Upload README.md

Files changed (1) hide show

README.md +70 -0

README.md CHANGED Viewed

@@ -1,3 +1,73 @@
 ---
 license: cc-by-nc-sa-4.0
 ---

 ---
+language:
+- en
+- ko
+pipeline_tag: text-generation
 license: cc-by-nc-sa-4.0
 ---
+# **SOLAR-tail-10.7B-Merge-v1.0**
+## Model Details
+**Model Developers** Kyujin Han (kyujinpy)
+**Method**
+Using [Mergekit](https://github.com/cg123/mergekit).
+- [upstage/SOLAR-10.7B-v1.0](https://huggingface.co/upstage/SOLAR-10.7B-v1.0)
+- [Yhyu13/LMCocktail-10.7B-v1](Yhyu13/LMCocktail-10.7B-v1)
+**Merge config**
+```
+slices:
+  - sources:
+      - model: upstage/SOLAR-10.7B-v1.0
+        layer_range: [0, 48]
+      - model: Yhyu13/LMCocktail-10.7B-v1
+        layer_range: [0, 48]
+merge_method: slerp
+base_model: upstage/SOLAR-10.7B-v1.0
+parameters:
+  t:
+    - filter: self_attn
+      value: [0, 0.5, 0.3, 0.7, 1]
+    - filter: mlp
+      value: [1, 0.5, 0.7, 0.3, 0]
+    - value: 0.5 # fallback for rest of tensors
+tokenizer_source: union
+dtype: float16
+```
+# **Model Benchmark**
+## Open leaderboard
+- Follow up as [link](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard).
+| Model | Average | ARC | HellaSwag | MMLU | TruthfulQA | Winogrande | GSM8K |
+| --- | --- | --- | --- | --- | --- | --- | --- |
+| PracticeLLM/SOLAR-tail-10.7B-Merge-v1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
+| jjourney1125/M-SOLAR-10.7B-v1.0 | 55.15 | 49.57 | 60.12 | 54.60 | 49.23 | 62.22 |
+| beomi/Yi-Ko-6B | 48.79 | 41.04 | 53.39 | 46.28 | 41.64 | 61.63 |
+| mistralai/Mistral-7B-v0.1 | 46.89 | 38.14 | 48.19 | 45.20 | 46.13 | 56.79 |
+# Implementation Code
+```python
+### KO-Platypus
+from transformers import AutoModelForCausalLM, AutoTokenizer
+import torch
+repo = "PracticeLLM/SOLAR-tail-10.7B-Merge-v1.0"
+OpenOrca = AutoModelForCausalLM.from_pretrained(
+        repo,
+        return_dict=True,
+        torch_dtype=torch.float16,
+        device_map='auto'
+)
+OpenOrca_tokenizer = AutoTokenizer.from_pretrained(repo)
+```
+---