Bahasalab
/

BahasaGPT-1

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Samsul Rahmadani commited on Apr 2, 2023

Commit

b50a7e8

·

1 Parent(s): 121b9d2

add readme

Files changed (2) hide show

.gitattributes +0 -1
README.md +37 -3

.gitattributes CHANGED Viewed

@@ -40,4 +40,3 @@ generation_config.json filter=lfs diff=lfs merge=lfs -text
 special_tokens_map.json filter=lfs diff=lfs merge=lfs -text
 tokenizer.json filter=lfs diff=lfs merge=lfs -text
 tokenizer_config.json filter=lfs diff=lfs merge=lfs -text
-README.md filter=lfs diff=lfs merge=lfs -text

 special_tokens_map.json filter=lfs diff=lfs merge=lfs -text
 tokenizer.json filter=lfs diff=lfs merge=lfs -text
 tokenizer_config.json filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -1,3 +1,37 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:bec9f2cd81f7396a64ee474145117d81139906443a21480cbe4993c97cb499aa
-size 5238

+---
+license: bigscience-bloom-rail-1.0
+---
+# BahasaGPT-1 Fine-Tuning Documentation Summary
+## Introduction
+This document provides an overview of the BahasaGPT-1 model, which is a fine-tuned model for a specific task in the Indonesian language. The model is based on the Bloomz-7B-mt architecture and is fine-tuned using a dataset of over 70,000 Indonesian instructions.
+## Model Details
+**Model Name:** BahasaGPT-1
+**Model Source:** Bloomz-7B-mt
+**Dataset for Fine-Tuning:** Over 70k Indonesia Instruct Dataset generated using the Alpaca method from the following sources:
+- [Stanford Alpaca](https://github.com/tatsu-lab/stanford_alpaca)
+- Translated instructions from OA ([Anh/data at main · LAION-AI/Anh](https://github.com/LAION-AI/Anh))
+## Fine-Tuning Process
+The BahasaGPT-1 model was fine-tuned using a dataset of over 70,000 Indonesian instructions, which were generated using the Alpaca method from Stanford and translated instructions from OA. This combination of datasets allowed the model to be better adapted to the specific needs of Indonesian language tasks.
+The fine-tuning process involved adjusting the model's weights and biases based on the input dataset. This was done iteratively to optimize the model's performance for the specific task in the Indonesian language.
+## Known Limitations
+Despite the successful fine-tuning, the BahasaGPT-1 model still has some limitations:
+1. **Hallucination:** The model sometimes generates outputs that may seem plausible but are not based on the input data. This may lead to incorrect or nonsensical responses in some cases.
+2. **Repeated Tokens:** The model occasionally produces repeated tokens in the output, which may affect the overall coherence and readability of the generated text.
+## Conclusion
+The BahasaGPT-1 model is a fine-tuned language model for Indonesian language tasks, based on the Bloomz-7B-mt architecture. The model was trained on a dataset of over 70,000 Indonesian instructions generated using the Alpaca method and translated instructions from OA. Despite some limitations, such as occasional hallucination and repeated tokens, the model provides a valuable tool for working with Indonesian language tasks.