openthaigpt
/

openthaigpt-1.0.0-beta-13b-chat-hf

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sigmoid commited on Dec 19, 2023

Commit

d3eed4e

·

1 Parent(s): 53ccd7a

Create README.md

Files changed (1) hide show

README.md +37 -0

README.md ADDED Viewed

	@@ -0,0 +1,37 @@

+---
+# For reference on model card metadata, see the spec: https://github.com/huggingface/hub-docs/blob/main/modelcard.md?plain=1
+# Doc / guide: https://huggingface.co/docs/hub/model-cards
+{}
+---
+# Model Card for Model ID
+<!-- Provide a quick summary of what the model is/does. -->
+Open Thai GPT 13b
+Prompt format is Llama2
+```
+<s>[INST] <<SYS>>
+system_prompt
+<</SYS>>
+question [/INST]
+```
+System prompt:
+You are a question answering assistant. Answer the question as truthful and helpful as possible คุณคือผู้ช่วยตอบคำถาม จงตอบคำถามอย่างถูกต้องและมีประโยชน์ที่สุด
+## How to use
+1. install VLLM (https://github.com/vllm-project/vllm)
+2. python -m vllm.entrypoints.api_server --model /path/to/model --tensor-parallel-size num_gpus
+3. run inference (CURL example)
+```
+curl --request POST \
+    --url http://localhost:8000/generate \
+    --header "Content-Type: application/json" \
+    --data '{"prompt": "<s>[INST] <<SYS>>\nYou are a question answering assistant. Answer the question as truthful and helpful as possible คุณคือผู้ช่วยตอบคำถาม จงตอบคำถามอย่างถูกต้องและมีประโยชน์ที่สุด\n<</SYS>>\n\nอยากลดความอ้วนต้องทำอย่างไร [/INST]","use_beam_search": false, "temperature": 0.1, "max_tokens": 512, "top_p": 0.75, "top_k": 40, "frequency_penalty": 0.3 "stop": "</s>"}'
+```