Bllossom
/

llama-3.2-Korean-Bllossom-AICA-5B

@@ -5,22 +5,22 @@ tags: []
 <a href="https://github.com/MLP-Lab/Bllossom">
-  <img src="https://raw.githubusercontent.com/teddysum/bllossom/main/bllossom_icon.png?token=GHSAT0AAAAAACZIELMFYS74LTHEVHXKCYQMZ2SUOVQ" width="30%" height="30%">
 </a>
 # Update!
-* [2024.12.06] -
-# Bllossom | [Demo]() | [Homepage](https://www.bllossom.ai/) | [Github](https://github.com/MLP-Lab/Bllossom) |
 ```bash
-저희 Bllossom 팀에서 llama3.2-3B 기반의 한국어-영어 언어모델 Bllossom-AICA 공개합니다.
 이번 Bllossom-AICA는 다음과 같은 특징을 보입니다.
- - 일반 언어모델, 시각-언어모델 양방향으로 활용이 가능합니다.
  - 이미지를 넣으면 시각-언어모델, 넣지 않으면 언어모델로 작동하며 시각-언어, 그냥 언어모델 양방향모두 학습 및 추론이 가능합니다.
- - 시각 정보의 이해를 바탕으로 언어모델의 성능이 대폭 향상되었습니다. (정성평가 기준 Bllossom-3.2-3B모델 대비 10%이상)
- - 영어 성능을 전혀 손상시키지 않은 완전한 Bilingual 모델입니다.
  - 한국어 OCR, 표, 그래프 해석에 최적화 되어있습니다.
  - 외부지식에 대한 선택적 추론 기능이 학습되었습니다. RAG를 활용할 때 질문과 관련 없는 오류가 섞인 정보의 경우 모델 스스로 활용하지 않습니다.
@@ -62,9 +62,6 @@ We, the Bllossom team, are pleased to announce the release of Bllossom-Vision, a
 ## Example code
-### Colab Tutorial
- - [Inference-Code-Link](Inference code coming soon)
 ### Python code (Use Vision-language Model)
 ```python
 from transformers import MllamaForConditionalGeneration,MllamaProcessor
@@ -73,11 +70,11 @@ from PIL import Image
 import requests
 model = MllamaForConditionalGeneration.from_pretrained(
-  'Bllossom/llama-3.2-Korean-Bllossom-AICA-5.2B',
   torch_dtype=torch.bfloat16,
   device_map='auto'
 )
-processor = MllamaProcessor.from_pretrained('Bllossom/llama-3.2-Korean-Bllossom-AICA-5.2B')
 url = "https://t1.daumcdn.net/cfile/tistory/21527E4A543DCABE1D"
 image = Image.open(requests.get(url, stream=True).raw)
@@ -110,11 +107,11 @@ from PIL import Image
 import requests
 model = MllamaForConditionalGeneration.from_pretrained(
-  'Bllossom/llama-3.2-Korean-Bllossom-AICA-5.2B',
   torch_dtype=torch.bfloat16,
   device_map='auto'
 )
-processor = MllamaProcessor.from_pretrained('Bllossom/llama-3.2-Korean-Bllossom-AICA-5.2B')
 url = "https://cdn.discordapp.com/attachments/1156141391798345742/1313407928287494164/E18489E185B3E1848FE185B3E18485E185B5E186ABE18489E185A3E186BA202021-11-1620E1848BE185A9E18492E185AE2011.png?ex=675005f4&is=674eb474&hm=fc9c4231203f53c27f6edd2420961c182dd4a1ed14d4b73e04127f11393729af&"
 image = Image.open(requests.get(url, stream=True).raw)
@@ -142,22 +139,21 @@ print(processor.decode(output[0]))
 ## Supported by
  - AICA  <img src="https://aica-gj.kr/images/logo.png" width="20%" height="20%">
- - 유클리드소프트 <img src="https://euclidsoft.co.kr/_next/image?url=%2Fimg%2Flogo.png&w=384&q=75" width="20%" height="20%">
 ## Citation
-**Language Model**
 ```text
-@misc{bllossom,
-  author = {ChangSu Choi, Yongbin Jeong, Seoyoon Park, InHo Won, HyeonSeok Lim, SangMin Kim, Yejee Kang, Chanhyuk Yoon, Jaewan Park, Yiseul Lee, HyeJin Lee, Younggyun Hahm, Hansaem Kim, KyungTae Lim},
-  title = {Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean},
-  year = {2024},
-  journal = {LREC-COLING 2024},
-  paperLink = {\url{https://arxiv.org/pdf/2403.10882}},
  },
 }
 ```
-**Vision-Language Model**
 ```text
 @misc{bllossom-V,
   author = {Dongjae Shin, Hyunseok Lim, Inho Won, Changsu Choi, Minjun Kim, Seungwoo Song, Hangyeol Yoo, Sangmin Kim, Kyungtae Lim},
@@ -169,6 +165,17 @@ print(processor.decode(output[0]))
  },
 }
 ```
 ## Contact
  - 임경태(KyungTae Lim), Professor at Seoultech. `[email protected]`
@@ -177,11 +184,5 @@ print(processor.decode(output[0]))
 ## Contributor
  - **신동재(Dongjae Shin)**, [email protected]
- - **임현석(Hyeonseok Lim)**, gustjrantk@seoultech.ac.kr
- - 원인호(Inho Won), wih1226@seoultech.ac.kr
- - 김민준(Minjun Kim), [email protected]
- - 유한결(Hangyeol Yoo), [email protected]
- - 송승우(Seungwoo Song), [email protected]
- - 육정훈(Jeonghun Yuk), [email protected]
- - 최창수(Chansu Choi), [email protected]
- - 송서현(Seohyun Song), [email protected]

 <a href="https://github.com/MLP-Lab/Bllossom">
+  <img src="https://img.newspim.com/news/2024/08/07/2408070813597620.jpg" width="30%" height="30%">
 </a>
 # Update!
+* [2024.12.12] 추가설명: 저희는 KMMLU, KoBEST, LogicKor 등 벤치 관련 학습/테스트/유사 데이터를 전혀 사용하지 않았습니다. 저런거 증강해가 쓰까서 학습하면 SOTA 성능 근접하게 나옵니다 모델위에 해보세요!
+* [2024.12.06] Bllossom-5B 모델 최초 업데이트!
+# Bllossom [Inference-Code-Link](https://drive.google.com/file/d/1AoxfoV0TSN-pGdc9fa3dRv3-NLZknHlJ/view?usp=sharing) [Tuning-Code-Link](https://drive.google.com/file/d/1AoxfoV0TSN-pGdc9fa3dRv3-NLZknHlJ/view?usp=sharing)
 ```bash
+저희 Bllossom 팀에서 llama3.2-3B 기반의 한국어-영어 언어모델 Bllossom-AICA-5B를 공개합니다.
 이번 Bllossom-AICA는 다음과 같은 특징을 보입니다.
+ - 일반 언어모델, 시각-언어모델 양방향으로 활용이 가능한 최초의 llama기반 3B확장 모델입니다. (코랩 무료 GPU에서 사용가능한 유일한 한국어)
  - 이미지를 넣으면 시각-언어모델, 넣지 않으면 언어모델로 작동하며 시각-언어, 그냥 언어모델 양방향모두 학습 및 추론이 가능합니다.
+ - 시각 정보의 이해를 바탕으로 언어모델의 성능이 대폭 향상되었습니다. (정성평가 기준 Bllossom-3.2-3B모델 대비 15%이상)
  - 한국어 OCR, 표, 그래프 해석에 최적화 되어있습니다.
  - 외부지식에 대한 선택적 추론 기능이 학습되었습니다. RAG를 활용할 때 질문과 관련 없는 오류가 섞인 정보의 경우 모델 스스로 활용하지 않습니다.
 ## Example code
 ### Python code (Use Vision-language Model)
 ```python
 from transformers import MllamaForConditionalGeneration,MllamaProcessor
 import requests
 model = MllamaForConditionalGeneration.from_pretrained(
+  'Bllossom/llama-3.2-Korean-Bllossom-AICA-5B',
   torch_dtype=torch.bfloat16,
   device_map='auto'
 )
+processor = MllamaProcessor.from_pretrained('Bllossom/llama-3.2-Korean-Bllossom-AICA-5B')
 url = "https://t1.daumcdn.net/cfile/tistory/21527E4A543DCABE1D"
 image = Image.open(requests.get(url, stream=True).raw)
 import requests
 model = MllamaForConditionalGeneration.from_pretrained(
+  'Bllossom/llama-3.2-Korean-Bllossom-AICA-5B',
   torch_dtype=torch.bfloat16,
   device_map='auto'
 )
+processor = MllamaProcessor.from_pretrained('Bllossom/llama-3.2-Korean-Bllossom-AICA-5B')
 url = "https://cdn.discordapp.com/attachments/1156141391798345742/1313407928287494164/E18489E185B3E1848FE185B3E18485E185B5E186ABE18489E185A3E186BA202021-11-1620E1848BE185A9E18492E185AE2011.png?ex=675005f4&is=674eb474&hm=fc9c4231203f53c27f6edd2420961c182dd4a1ed14d4b73e04127f11393729af&"
 image = Image.open(requests.get(url, stream=True).raw)
 ## Supported by
  - AICA  <img src="https://aica-gj.kr/images/logo.png" width="20%" height="20%">
 ## Citation
+**Vision-Language Model**
 ```text
+@misc{VLR-Bench,
+  author = {Hyeonseok Lim, Dongjae Shin, Seohyun Song, Inho Won, Minjun Kim, Junghun Yuk, Hangyeol Yoo, Haneol Jang, Kyungtae Lim},
+  title = {VLR-Bench: Multilingual Benchmark Dataset for Vision-Language Retrieval Augmented Generation},
+  year = {2025},
+  publisher = {GitHub},
+  journal = {COLING 2025},
  },
 }
 ```
 ```text
 @misc{bllossom-V,
   author = {Dongjae Shin, Hyunseok Lim, Inho Won, Changsu Choi, Minjun Kim, Seungwoo Song, Hangyeol Yoo, Sangmin Kim, Kyungtae Lim},
  },
 }
 ```
+**Language Model**
+```text
+@misc{bllossom,
+  author = {ChangSu Choi, Yongbin Jeong, Seoyoon Park, InHo Won, HyeonSeok Lim, SangMin Kim, Yejee Kang, Chanhyuk Yoon, Jaewan Park, Yiseul Lee, HyeJin Lee, Younggyun Hahm, Hansaem Kim, KyungTae Lim},
+  title = {Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean},
+  year = {2024},
+  journal = {LREC-COLING 2024},
+  paperLink = {\url{https://arxiv.org/pdf/2403.10882}},
+ },
+}
+```
 ## Contact
  - 임경태(KyungTae Lim), Professor at Seoultech. `[email protected]`
 ## Contributor
  - **신동재(Dongjae Shin)**, [email protected]
+ - **유한결(Hangyeol Yoo)**, hgyoo@seoultech.ac.kr
+ - **임현석(Hyeonseok Lim)**, gustjrantk@seoultech.ac.kr