tainc commited on
Commit
eb86eb6
·
verified ·
1 Parent(s): 7ec12eb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -35
README.md CHANGED
@@ -1,4 +1,5 @@
1
  ---
 
2
  base_model:
3
  - aisingapore/llama3-8b-cpt-sea-lionv2-base
4
  language:
@@ -10,7 +11,6 @@ language:
10
  license: llama3
11
  ---
12
  # Llama3 8B CPT SEA-Lionv2.1 Instruct
13
-
14
  SEA-LION is a collection of Large Language Models (LLMs) which has been pretrained and instruct-tuned for the Southeast Asia (SEA) region.
15
 
16
  Llama3 8B CPT SEA-Lionv2.1 Instruct is a multilingual model which has been fine-tuned with around **100,000 English instruction-completion pairs** alongside a smaller pool of around **50,000 instruction-completion pairs** from other ASEAN languages, such as Indonesian, Thai and Vietnamese.
@@ -23,7 +23,7 @@ SEA-LION stands for _Southeast Asian Languages In One Network_.
23
  - **Developed by:** Products Pillar, AI Singapore
24
  - **Funded by:** Singapore NRF
25
  - **Model type:** Decoder
26
- - **Languages:** English, Indonesian, Thai, Vietnamese, Tamil
27
  - **License:** [Llama3 Community License](https://huggingface.co/meta-llama/Meta-Llama-3-8B/blob/main/LICENSE)
28
 
29
  ## Model Details
@@ -123,7 +123,6 @@ It is important for users to be aware that our model exhibits certain limitation
123
 
124
  ## Limitations
125
  ### Safety
126
-
127
  Current SEA-LION models, including this commercially permissive release, have not been aligned for safety. Developers and users should perform their own safety fine-tuning and related security measures. In no event shall the authors be held liable for any claim, damages, or other liability arising from the use of the released weights and codes.
128
 
129
  ## Technical Specifications
@@ -141,48 +140,17 @@ Link to dataset: _coming soon_
141
  We encourage researchers, developers, and language enthusiasts to actively contribute to the enhancement and expansion of SEA-LION. Contributions can involve identifying and reporting bugs, sharing pre-training, instruction, and preference data, improving documentation usability, proposing and implementing new model evaluation tasks and metrics, or training versions of the model in additional Southeast Asian languages. Join us in shaping the future of SEA-LION by sharing your expertise and insights to make these models more accessible, accurate, and versatile. Please check out our GitHub for further information on the call for contributions.
142
 
143
  ## The Team
144
-
145
- Choa Esther<br>
146
- Cheng Nicholas<br>
147
- Huang Yuli<br>
148
- Lau Wayne<br>
149
- Lee Chwan Ren<br>
150
- Leong Wai Yi<br>
151
- Leong Wei Qi<br>
152
- Li Yier<br>
153
- Liu Bing Jie Darius<br>
154
- Lovenia Holy<br>
155
- Montalan Jann Railey<br>
156
- Ng Boon Cheong Raymond<br>
157
- Ngui Jian Gang<br>
158
- Nguyen Thanh Ngan<br>
159
- Ong Brandon<br>
160
- Ong Tat-Wee David<br>
161
- Ong Zhi Hao<br>
162
- Rengarajan Hamsawardhini<br>
163
- Siow Bryan<br>
164
- Susanto Yosephine<br>
165
- Tai Ngee Chia<br>
166
- Tan Choon Meng<br>
167
- Teo Eng Sipp Leslie<br>
168
- Teo Wei Yi<br>
169
- Tjhi William<br>
170
- Teng Walter<br>
171
- Yeo Yeow Tong<br>
172
- Yong Xianbin<br>
173
 
174
  ## Acknowledgements
175
-
176
  [AI Singapore](​​https://aisingapore.org/) is a national programme supported by the National Research Foundation, Singapore and hosted by the National University of Singapore. Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not reflect the views of the National Research Foundation or the National University of Singapore.
177
 
178
  ## Contact
179
-
180
  For more info, please contact us using this [SEA-LION Inquiry Form](https://forms.gle/sLCUVb95wmGf43hi6)
181
 
182
  [Link to SEA-LION's GitHub repository](https://github.com/aisingapore/sealion)
183
 
184
  ## Disclaimer
185
-
186
  This is the repository for the commercial instruction-tuned model.
187
  The model has _not_ been aligned for safety.
188
  Developers and users should perform their own safety fine-tuning and related security measures.
 
1
  ---
2
+ new_version: aisingapore/llama3.1-8b-cpt-sea-lionv3-instruct
3
  base_model:
4
  - aisingapore/llama3-8b-cpt-sea-lionv2-base
5
  language:
 
11
  license: llama3
12
  ---
13
  # Llama3 8B CPT SEA-Lionv2.1 Instruct
 
14
  SEA-LION is a collection of Large Language Models (LLMs) which has been pretrained and instruct-tuned for the Southeast Asia (SEA) region.
15
 
16
  Llama3 8B CPT SEA-Lionv2.1 Instruct is a multilingual model which has been fine-tuned with around **100,000 English instruction-completion pairs** alongside a smaller pool of around **50,000 instruction-completion pairs** from other ASEAN languages, such as Indonesian, Thai and Vietnamese.
 
23
  - **Developed by:** Products Pillar, AI Singapore
24
  - **Funded by:** Singapore NRF
25
  - **Model type:** Decoder
26
+ - **Languages supported:** English, Indonesian, Thai, Vietnamese, Tamil
27
  - **License:** [Llama3 Community License](https://huggingface.co/meta-llama/Meta-Llama-3-8B/blob/main/LICENSE)
28
 
29
  ## Model Details
 
123
 
124
  ## Limitations
125
  ### Safety
 
126
  Current SEA-LION models, including this commercially permissive release, have not been aligned for safety. Developers and users should perform their own safety fine-tuning and related security measures. In no event shall the authors be held liable for any claim, damages, or other liability arising from the use of the released weights and codes.
127
 
128
  ## Technical Specifications
 
140
  We encourage researchers, developers, and language enthusiasts to actively contribute to the enhancement and expansion of SEA-LION. Contributions can involve identifying and reporting bugs, sharing pre-training, instruction, and preference data, improving documentation usability, proposing and implementing new model evaluation tasks and metrics, or training versions of the model in additional Southeast Asian languages. Join us in shaping the future of SEA-LION by sharing your expertise and insights to make these models more accessible, accurate, and versatile. Please check out our GitHub for further information on the call for contributions.
141
 
142
  ## The Team
143
+ Cheng Nicholas, Choa Esther, Huang Yuli, Lau Wayne, Lee Chwan Ren, Leong Wai Yi, Leong Wei Qi, Li Yier, Liu Bing Jie Darius, Lovenia Holy, Montalan Jann Railey, Ng Boon Cheong Raymond, Ngui Jian Gang, Nguyen Thanh Ngan, Ong Brandon, Ong Tat-Wee David, Ong Zhi Hao, Rengarajan Hamsawardhini, Siow Bryan, Susanto Yosephine, Tai Ngee Chia, Tan Choon Meng, Teo Eng Sipp Leslie, Teo Wei Yi, Tjhi William, Teng Walter, Yeo Yeow Tong, Yong Xianbin
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
144
 
145
  ## Acknowledgements
 
146
  [AI Singapore](​​https://aisingapore.org/) is a national programme supported by the National Research Foundation, Singapore and hosted by the National University of Singapore. Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not reflect the views of the National Research Foundation or the National University of Singapore.
147
 
148
  ## Contact
 
149
  For more info, please contact us using this [SEA-LION Inquiry Form](https://forms.gle/sLCUVb95wmGf43hi6)
150
 
151
  [Link to SEA-LION's GitHub repository](https://github.com/aisingapore/sealion)
152
 
153
  ## Disclaimer
 
154
  This is the repository for the commercial instruction-tuned model.
155
  The model has _not_ been aligned for safety.
156
  Developers and users should perform their own safety fine-tuning and related security measures.