brian-lim
/

smile-style-transfer

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

brian-lim commited on Dec 21, 2023

Commit

be423b2

·

1 Parent(s): 4d6a9e2

Update README.md

Files changed (1) hide show

README.md +29 -0

README.md CHANGED Viewed

@@ -1,3 +1,32 @@
 ---
 license: apache-2.0
 ---

 ---
 license: apache-2.0
+datasets:
+- brian-lim/smile_style_orca
+language:
+- ko
 ---
+# Korean Style Transfer
+This model is a fine-tuned version of [Synatra-7B-v0.3-dpo](https://huggingface.co/maywell/Synatra-7B-v0.3-dpo) using a Korean style dataset provided by Smilegate AI (https://github.com/smilegate-ai/korean_smile_style_dataset/tree/main).
+Since the original dataset is tabular and not fit for training the LLM, I have preprocessed it into instruction-input-output format, which can be found (here)[https://huggingface.co/datasets/brian-lim/smile_style_orca].
+The dataset is then fed into the ChatML template. Feel free to use my version of the dataset as needed.
+해당 모델은 [Synatra-7B-v0.3-dpo](https://huggingface.co/maywell/Synatra-7B-v0.3-dpo) 모델을 스마일게이트 AI에서 제공하는 Smile style 데이터셋으로 파인튜닝 했습니다.
+기존 데이터셋은 테이블 형태로 되어있어 해당 데이터를 instruction-input-output 형태로 만들었고, (여기)[https://huggingface.co/datasets/brian-lim/smile_style_orca]에서 확인 가능합니다.
+데이터셋을 불러온 뒤 ChatML 형식에 맞춰 훈련 데이터 구축을 한 뒤 진행했습니다. 필요하시다면 자유롭게 사용하시기 바랍니다.
+# Intended use & limitations
+To be added
+추가 예정
+# How to use
+To be added
+추가예정
+---
+license: apache-2.0
+---