MgGPT
/

MgGPT0-7B

Safetensors

Arabic

llama

Model card Files Files and versions Community

jianqing666 commited on 19 days ago

Commit

595e28f

verified ·

1 Parent(s): 45c367c

Update README.md

Browse files

Files changed (1) hide show

README.md +9 -11

README.md CHANGED Viewed

@@ -2,22 +2,20 @@
 license: apache-2.0
 language:
 - ar
-- zh
-- en
 ---
-# <b>MgGPT-7B</b>
-MgGPT-7B is a fully fine-tuned generative text model collection based on LlaMA2, particularly in the
 Arabic language domain. This is the repository for the version of 7B pre-trained model.
 ---
 ## Model Details
-We have released the MgGPT family of large language models, which is a collection of fully fine-tuned generative text models based on LlaMA2(MgGPT-7B, MgGPT-13B), LlaMA3(MgGPT-8B, MgGPT-70B), Qwen2(MgGPT-32B). Our models include two main categories: MgGPT and MgGPT-chat. MgGPT-chat is an optimized version specifically designed for dialogue applications. It is worth mentioning that our models have demonstrated superior performance compared to all currently available open-source Arabic dialogue models in multiple benchmark tests. Furthermore, in our human evaluations, our models have shown comparable satisfaction levels to some closed-source models, such as ChatGPT, in the Arabic language.
 <!-- ## Model Developers
 We are from the King Abdullah University of Science and Technology (KAUST), the Chinese University of Hong Kong, Shenzhen (CUHKSZ), the Shenzhen Research Institute of Big Data (SRIBD), and King AbdulAziz University (KAU). -->
 ## Variations
-MgGPT families come in a range of parameter sizes —— 7B, 8B, 13B, 32B and 70B, each size of model has a base category and a -chat category.
 <!-- ## Paper -->
 <!-- The paper can be accessed at [link](https://huggingface.co/FreedomIntelligence/AceGPT-v1.5-13B-Chat/blob/main/Second_Language_(Arabic)_Acquisition_of_LLMs_via_Progressive_Vocabulary_Expansion.pdf). -->
 ## Input
@@ -28,11 +26,11 @@ Models output text only.
 | Model         | Avg.   | [ArabicMMLU]((https://github.com/mbzuai-nlp/ArabicMMLU)) | [ArabicMMLU]((https://github.com/mbzuai-nlp/ArabicMMLU)) | ARC   | EXAMs | ACVA (clean) | ACVA (all) |
 |---------------|--------|----------------|-----------------------|-------|-------|--------------|------------|
-| **MgGPT-7B**      | 45.19  | 34.03          | 37.00                | 17.49 | 37.28 | 72.69        | 72.67      |
-| MgGPT-8B      | 58.94  | 48.41          | 50.17                | 49.91 | 46.15 | 80.14        | 78.84      |
-| MgGPT-13B     | 52.11  | 40.95          | 47.60                | 31.57 | 35.10 | 79.45        | 78.01      |
-| MgGPT-32B     | 68.75  | 58.71          | 65.67                | 71.69 | 52.74 | 82.66        | 81.04      |
-| MgGPT-70B     | 72.62  | 65.19          | 67.71                | 80.93 | 56.19 | 84.79        | 80.93      |
 | Jais-30B-v3   | 57.02  | 43.42          | 44.47                | 45.56 | 45.70 | 83.39        | 79.51      |
 | GPT-3.5       | 60.71  | 49.07          | 57.70                | 60.24 | 45.93 | 74.45        | 76.88      |
 | GPT-4         | 74.08  | 65.06          | 72.50                | 85.67 | 57.76 | 84.06        | 79.43      |

 license: apache-2.0
 language:
 - ar
 ---
+# <b>MgGPT0-7B</b>
+MgGPT0-7B is a fully fine-tuned generative text model collection based on LlaMA2, particularly in the
 Arabic language domain. This is the repository for the version of 7B pre-trained model.
 ---
 ## Model Details
+We have released the MgGPT0 family of large language models, which is a collection of fully fine-tuned generative text models based on LlaMA2(MgGPT0-7B, MgGPT0-13B), LlaMA3(MgGPT0-8B, MgGPT0-70B), Qwen2(MgGPT0-32B). Our models include two main categories: MgGPT0 and MgGPT0-chat. MgGPT0-chat is an optimized version specifically designed for dialogue applications. It is worth mentioning that our models have demonstrated superior performance compared to all currently available open-source Arabic dialogue models in multiple benchmark tests. Furthermore, in our human evaluations, our models have shown comparable satisfaction levels to some closed-source models, such as ChatGPT, in the Arabic language.
 <!-- ## Model Developers
 We are from the King Abdullah University of Science and Technology (KAUST), the Chinese University of Hong Kong, Shenzhen (CUHKSZ), the Shenzhen Research Institute of Big Data (SRIBD), and King AbdulAziz University (KAU). -->
 ## Variations
+MgGPT0 families come in a range of parameter sizes —— 7B, 8B, 13B, 32B and 70B, each size of model has a base category and a -chat category.
 <!-- ## Paper -->
 <!-- The paper can be accessed at [link](https://huggingface.co/FreedomIntelligence/AceGPT-v1.5-13B-Chat/blob/main/Second_Language_(Arabic)_Acquisition_of_LLMs_via_Progressive_Vocabulary_Expansion.pdf). -->
 ## Input
 | Model         | Avg.   | [ArabicMMLU]((https://github.com/mbzuai-nlp/ArabicMMLU)) | [ArabicMMLU]((https://github.com/mbzuai-nlp/ArabicMMLU)) | ARC   | EXAMs | ACVA (clean) | ACVA (all) |
 |---------------|--------|----------------|-----------------------|-------|-------|--------------|------------|
+| **MgGPT0-7B**      | 45.19  | 34.03          | 37.00                | 17.49 | 37.28 | 72.69        | 72.67      |
+| MgGPT0-8B      | 58.94  | 48.41          | 50.17                | 49.91 | 46.15 | 80.14        | 78.84      |
+| MgGPT0-13B     | 52.11  | 40.95          | 47.60                | 31.57 | 35.10 | 79.45        | 78.01      |
+| MgGPT0-32B     | 68.75  | 58.71          | 65.67                | 71.69 | 52.74 | 82.66        | 81.04      |
+| MgGPT0-70B     | 72.62  | 65.19          | 67.71                | 80.93 | 56.19 | 84.79        | 80.93      |
 | Jais-30B-v3   | 57.02  | 43.42          | 44.47                | 45.56 | 45.70 | 83.39        | 79.51      |
 | GPT-3.5       | 60.71  | 49.07          | 57.70                | 60.24 | 45.93 | 74.45        | 76.88      |
 | GPT-4         | 74.08  | 65.06          | 72.50                | 85.67 | 57.76 | 84.06        | 79.43      |