Trendyol
/

Trendyol-LLM-7b-chat-dpo-v1.0

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

yusufcakmak commited on Mar 5, 2024

Commit

98bc16f

·

verified ·

1 Parent(s): 026bfc3

Fixed typo

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ Trendyol LLM v1.0 - DPO is a generative model that is based on Mistral 7B model.
 **Output** Models generate text only.
-**Model Architecture** Trendyol LLM is an auto-regressive language model (based on Mistral 7b) that uses an optimized transformer architecture. The chat version is fine-tuned on 11K sets (prompt-chosen-reject) with the following trainables by using LoRA:
 - **lr**=5e-6
 - **lora_rank**=64

 **Output** Models generate text only.
+**Model Architecture** Trendyol LLM is an auto-regressive language model (based on Mistral 7b) that uses an optimized transformer architecture. The DPO version is fine-tuned on 11K sets (prompt-chosen-reject) with the following trainables by using LoRA:
 - **lr**=5e-6
 - **lora_rank**=64