weiqipedia commited on
Commit
fee5443
·
1 Parent(s): 3fd7468

Minor fixes for README.md

Browse files

1. Minor language tweaks
2. Minor tweaks to names

Files changed (1) hide show
  1. README.md +9 -10
README.md CHANGED
@@ -3,24 +3,23 @@ license: mit
3
  ---
4
  # SEA-LION
5
 
6
- SEA-LION is a collection of LLMs which has been pretrained and instruct-tuned for the Southeast Asia (SEA) region.
7
- The models range from 3 billion to 7 billion parameters.
8
  This is the card for the SEA-LION 3B model.
9
 
10
- SEA-LION stands for <i>Southeast Asia Languages In One Network</i>.
11
 
12
 
13
  ## Model Details
14
 
15
  ### Model Description
16
 
17
- The SEA-LION model is a significant leap forward in the field of natural language processing,
18
- specifically trained to understand Southeast Asia (SEA) regional context.
19
 
20
- SEA-LION is built on the robust MPT architecture and utilize a vocabulary size of 256K.
21
 
22
- The model employs our custom SEABPETokenizer for tokenization.
23
- Our SEABPETokenizer is specially tailored for SEA languages, ensuring optimal model performance.
24
 
25
  The training data for SEA-LION encompasses 980B tokens.
26
 
@@ -108,14 +107,14 @@ The tokenizer type is Byte-Pair Encoding (BPE).
108
  ## The Team
109
 
110
  Lam Zhiwen Clarence<br>
111
- Leong Weiqi<br>
112
  Li Yier<br>
113
  Liu Darius<br>
114
  Lovenia Holy<br>
115
  Montalan Jann Railey<br>
116
  Ng Raymond<br>
117
  Ngui Jian Gang<br>
118
- Nguyen Ngan Thanh<br>
119
  Ong Tat-Wee David<br>
120
  Rengarajan Hamsawardhini<br>
121
  Susanto Yosephine<br>
 
3
  ---
4
  # SEA-LION
5
 
6
+ SEA-LION is a collection of Large Language Models (LLMs) which has been pretrained and instruct-tuned for the Southeast Asia (SEA) region.
7
+ The size of the models range from 3 billion to 7 billion parameters.
8
  This is the card for the SEA-LION 3B model.
9
 
10
+ SEA-LION stands for <i>Southeast Asian Languages In One Network</i>.
11
 
12
 
13
  ## Model Details
14
 
15
  ### Model Description
16
 
17
+ The SEA-LION model is a significant leap forward in the field of Natural Language Processing,
18
+ specifically trained to understand the SEA regional context.
19
 
20
+ SEA-LION is built on the robust MPT architecture and has a vocabulary size of 256K.
21
 
22
+ For tokenization, the model employs our custom SEABPETokenizer, which is specially tailored for SEA languages, ensuring optimal model performance.
 
23
 
24
  The training data for SEA-LION encompasses 980B tokens.
25
 
 
107
  ## The Team
108
 
109
  Lam Zhiwen Clarence<br>
110
+ Leong Wei Qi<br>
111
  Li Yier<br>
112
  Liu Darius<br>
113
  Lovenia Holy<br>
114
  Montalan Jann Railey<br>
115
  Ng Raymond<br>
116
  Ngui Jian Gang<br>
117
+ Nguyen Thanh Ngan<br>
118
  Ong Tat-Wee David<br>
119
  Rengarajan Hamsawardhini<br>
120
  Susanto Yosephine<br>