Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,32 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
|
|
|
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
+
datasets:
|
4 |
+
- brian-lim/smile_style_orca
|
5 |
+
language:
|
6 |
+
- ko
|
7 |
---
|
8 |
+
# Korean Style Transfer
|
9 |
+
|
10 |
+
This model is a fine-tuned version of [Synatra-7B-v0.3-dpo](https://huggingface.co/maywell/Synatra-7B-v0.3-dpo) using a Korean style dataset provided by Smilegate AI (https://github.com/smilegate-ai/korean_smile_style_dataset/tree/main).
|
11 |
+
Since the original dataset is tabular and not fit for training the LLM, I have preprocessed it into instruction-input-output format, which can be found (here)[https://huggingface.co/datasets/brian-lim/smile_style_orca].
|
12 |
+
The dataset is then fed into the ChatML template. Feel free to use my version of the dataset as needed.
|
13 |
+
|
14 |
+
ν΄λΉ λͺ¨λΈμ [Synatra-7B-v0.3-dpo](https://huggingface.co/maywell/Synatra-7B-v0.3-dpo) λͺ¨λΈμ μ€λ§μΌκ²μ΄νΈ AIμμ μ 곡νλ Smile style λ°μ΄ν°μ
μΌλ‘ νμΈνλ νμ΅λλ€.
|
15 |
+
κΈ°μ‘΄ λ°μ΄ν°μ
μ ν
μ΄λΈ ννλ‘ λμ΄μμ΄ ν΄λΉ λ°μ΄ν°λ₯Ό instruction-input-output ννλ‘ λ§λ€μκ³ , (μ¬κΈ°)[https://huggingface.co/datasets/brian-lim/smile_style_orca]μμ νμΈ κ°λ₯ν©λλ€.
|
16 |
+
λ°μ΄ν°μ
μ λΆλ¬μ¨ λ€ ChatML νμμ λ§μΆ° νλ ¨ λ°μ΄ν° ꡬμΆμ ν λ€ μ§ννμ΅λλ€. νμνμλ€λ©΄ μμ λ‘κ² μ¬μ©νμκΈ° λ°λλλ€.
|
17 |
+
|
18 |
+
# Intended use & limitations
|
19 |
+
|
20 |
+
To be added
|
21 |
+
|
22 |
+
μΆκ° μμ
|
23 |
+
|
24 |
+
# How to use
|
25 |
+
|
26 |
+
To be added
|
27 |
+
|
28 |
+
μΆκ°μμ
|
29 |
+
|
30 |
+
---
|
31 |
+
license: apache-2.0
|
32 |
+
---
|