GAI-LLM
/

ko-en-llama2-13b-mixed-v3

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

The license is cc-by-nc-2.0.

GAI-LLM/ko-en-llama2-13b-mixed-v3

Model Details

Model Developers Donghoon Oh, Hanmin Myung, Eunyoung Kim (SK C&C G.AI Eng)

Input Models input text only.

Output Models generate text only.

Model Architecture
ko-en-llama2-13b-mixed-v3 is an auto-regressive language model based on the LLaMA2 transformer architecture.

Base Model hyunseoki/ko-en-llama2-13b

Training Dataset

We combined Open Korean Dateset using mixed-strategy.
- Kopen-platypus + kaist_cot_deepL
We use A100 GPU 80GB * 8, when training.

Model Benchmark

KO-LLM leaderboard

Follow up as Open KO-LLM LeaderBoard.

Implementation Code

### GAI-LLM/ko-en-llama2-13b-mixed-v3
from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

repo = "GAI-LLM/ko-en-llama2-13b-mixed-v3"
model = AutoModelForCausalLM.from_pretrained(
        repo,
        return_dict=True,
        torch_dtype=torch.float16,
        device_map='auto'
)
tokenizer = AutoTokenizer.from_pretrained(repo)

Downloads last month: 594

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for GAI-LLM/ko-en-llama2-13b-mixed-v3

Quantizations

1 model

Spaces using GAI-LLM/ko-en-llama2-13b-mixed-v3 6