license: apache-2.0
datasets:
- brian-lim/smile_style_orca
language:
- ko
Korean Style Transfer
This model is a fine-tuned version of Synatra-7B-v0.3-dpo using a Korean style dataset provided by Smilegate AI (https://github.com/smilegate-ai/korean_smile_style_dataset/tree/main). Since the original dataset is tabular and not fit for training the LLM, I have preprocessed it into instruction-input-output format, which can be found (here)[https://huggingface.co/datasets/brian-lim/smile_style_orca]. The dataset is then fed into the ChatML template. Feel free to use my version of the dataset as needed.
ν΄λΉ λͺ¨λΈμ Synatra-7B-v0.3-dpo λͺ¨λΈμ μ€λ§μΌκ²μ΄νΈ AIμμ μ 곡νλ Smile style λ°μ΄ν°μ μΌλ‘ νμΈνλ νμ΅λλ€. κΈ°μ‘΄ λ°μ΄ν°μ μ ν μ΄λΈ ννλ‘ λμ΄μμ΄ ν΄λΉ λ°μ΄ν°λ₯Ό instruction-input-output ννλ‘ λ§λ€μκ³ , (μ¬κΈ°)[https://huggingface.co/datasets/brian-lim/smile_style_orca]μμ νμΈ κ°λ₯ν©λλ€. λ°μ΄ν°μ μ λΆλ¬μ¨ λ€ ChatML νμμ λ§μΆ° νλ ¨ λ°μ΄ν° ꡬμΆμ ν λ€ μ§ννμ΅λλ€. νμνμλ€λ©΄ μμ λ‘κ² μ¬μ©νμκΈ° λ°λλλ€.
Intended use & limitations
To be added
μΆκ° μμ
How to use
To be added
μΆκ°μμ