nymtheescobar commited on
Commit
97e27c3
·
verified ·
1 Parent(s): f959f10

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -8
README.md CHANGED
@@ -3,9 +3,9 @@ language:
3
  - bn
4
  - en
5
  license: llama3
6
- base_model: meta-llama/Meta-Llama-3.1-8B
 
7
  datasets:
8
- - uonlp/CulturaX
9
  - BanglaLLM/bangla-alpaca-orca
10
  tags:
11
  - bangla
@@ -16,16 +16,14 @@ tags:
16
  - Transformer
17
  ---
18
 
19
- # Bangla LLaMA-3 8B Bangla2B Instruct v0.1 [finetune]
20
 
21
- Welcome to the inaugural release of the Bangla LLaMA-3 8B unolp-culturax instruct model – an important step in advancing LLMs for the Bangla language. This model is ready for immediate inference.
22
 
23
  > **Please Note:** This model, labeled as a foundational Bangla Language Model (LLM), is designed primarily for Causal Language Modeling (LM) purposes.
24
 
25
  ## Model description
26
 
27
- The Bangla LLaMA models have been enhanced and tailored specifically with an extensive Bangla vocabulary of 16,000 tokens, building upon the foundation set by the original LLaMA-3.
28
-
29
  - **Model type:** A 8B parameter model for Causal LM pre-trained on unolp/culturax dataset and then instruct finetuned with BanglaLLM/bangla-alpaca-orca.
30
  - **Language(s):** Bangla and English
31
  - **License:** GNU General Public License v3.0
@@ -40,8 +38,14 @@ The Bangla LLaMA models have been enhanced and tailored specifically with an ext
40
  | Bangla LLaMA 7B Base | Base model | 12GB | LLaMA 7B | 7B | [HF Hub](https://huggingface.co/BanglaLLM/bangla-llama-7b-base-v0.1) |
41
  | Bangla LLaMA 13B Base | Base model | 4GB | LLaMA 13B | 13B | [HF Hub](https://huggingface.co/BanglaLLM/bangla-llama-13b-base-v0.1) |
42
  | Bangla LLaMA 7B Instruct | Instruction following model | 145k instructions | Bangla LLaMA 7B Base | 7B | [HF Hub](https://huggingface.co/BanglaLLM/bangla-llama-7b-instruct-v0.1) |
43
- | Bangla LLaMA 13B Instruct | Instruction following model | 145k instructions | Bangla LLaMA 13B Base | 13B | [HF Hub](BanglaLLM/bangla-llama-13b-instruct-v0.1) |
44
-
 
 
 
 
 
 
45
  ## Usage Note
46
 
47
  It's important to note that the models have not undergone detoxification. Therefore, while they possess impressive linguistic capabilities, there is a possibility for them to generate content that could be deemed harmful or offensive. We urge users to exercise discretion and supervise the model's outputs closely, especially in public or sensitive applications.
 
3
  - bn
4
  - en
5
  license: llama3
6
+ base_model:
7
+ - BanglaLLM/BanglaLLama-3-8b-unolp-culturax-base-v0.0.1
8
  datasets:
 
9
  - BanglaLLM/bangla-alpaca-orca
10
  tags:
11
  - bangla
 
16
  - Transformer
17
  ---
18
 
19
+ # Bangla LLaMA-3 8B bangla-alpaca-orca base v0.1 [pretrained]
20
 
21
+ Welcome to the inaugural release of the Bangla LLaMA-3 8B unolp-culturax base model – an important step in advancing LLMs for the Bangla language. This model is ready for immediate inference.
22
 
23
  > **Please Note:** This model, labeled as a foundational Bangla Language Model (LLM), is designed primarily for Causal Language Modeling (LM) purposes.
24
 
25
  ## Model description
26
 
 
 
27
  - **Model type:** A 8B parameter model for Causal LM pre-trained on unolp/culturax dataset and then instruct finetuned with BanglaLLM/bangla-alpaca-orca.
28
  - **Language(s):** Bangla and English
29
  - **License:** GNU General Public License v3.0
 
38
  | Bangla LLaMA 7B Base | Base model | 12GB | LLaMA 7B | 7B | [HF Hub](https://huggingface.co/BanglaLLM/bangla-llama-7b-base-v0.1) |
39
  | Bangla LLaMA 13B Base | Base model | 4GB | LLaMA 13B | 13B | [HF Hub](https://huggingface.co/BanglaLLM/bangla-llama-13b-base-v0.1) |
40
  | Bangla LLaMA 7B Instruct | Instruction following model | 145k instructions | Bangla LLaMA 7B Base | 7B | [HF Hub](https://huggingface.co/BanglaLLM/bangla-llama-7b-instruct-v0.1) |
41
+ | Bangla LLaMA 13B Instruct | Instruction following model | 145k instructions | Bangla LLaMA 13B Base | 13B | [HF Hub](https://huggingface.co/BanglaLLM/bangla-llama-13b-instruct-v0.1) |
42
+ | Bangla LLaMA 3 8B Base | Base model | 12.4M | LLaMA 3 8b | 8B | [HF Hub](https://huggingface.co/BanglaLLM/BanglaLLama-3-8b-unolp-culturax-base-v0.0.1)
43
+ | Bangla LLaMA 3 8B Instruct | Instruction following model | 172k instructions | Bangla LLaMA 3 8B Base | 8B | [HF Hub](https://huggingface.co/BanglaLLM/BanglaLLama-3-8b-bangla-alpaca-orca-instruct-v0.0.1)
44
+ | Bangla LLaMA 3.1 8B Base | Base model | 12.4M | LLaMA 3.1 8b | 8B | [HF Hub](https://huggingface.co/BanglaLLM/BanglaLLama-3.1-8b-unolp-culturax-base-v0.0.1)
45
+ | Bangla LLaMA 3.1 8B Instruct | Instruction following model | 172k instructions | Bangla LLaMA 3.1 8B Base | 8b | [HF Hub](https://huggingface.co/BanglaLLM/BanglaLLama-3.1-8b-bangla-alpaca-orca-instruct-v0.0.1)
46
+ | Bangla LLaMA 3.2 1B Base | Base model | 12.4M | LLaMA 3.2 1b | 1b | [HF Hub](https://huggingface.co/BanglaLLM/BanglaLLama-3.2-1b-unolp-culturax-base-v0.0.1)
47
+ | Bangla LLaMA 3.2 1B Instruct | Instruction following model | 172k instructions | Bangla LLaMA 3.2 1B Base | 1b | [HF Hub](https://huggingface.co/BanglaLLM/BanglaLLama-3.2-1b-bangla-alpaca-orca-instruct-v0.0.1)
48
+ | Bangla LLaMA 3.2 3B Instruct| Instruction following model | 172k instructions | Bangla LLaMA 3.2 3B Base | 3B | [HF Hub](https://huggingface.co/BanglaLLM/BanglaLLama-3.2-3b-bangla-alpaca-orca-instruct-v0.0.1)
49
  ## Usage Note
50
 
51
  It's important to note that the models have not undergone detoxification. Therefore, while they possess impressive linguistic capabilities, there is a possibility for them to generate content that could be deemed harmful or offensive. We urge users to exercise discretion and supervise the model's outputs closely, especially in public or sensitive applications.