End of training

Files changed (2) hide show

README.md CHANGED Viewed

@@ -110,7 +110,9 @@ special_tokens:
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/banglallm/banglallm-training/runs/balglallm-training-llama3.2-1b-finetuning-5Oct2024_10_00_AM-id-1)
 # BanglaLLama-3.2-1b-bangla-alpaca-orca-instruct-v0.0.1
-This model is a fine-tuned version of [BanglaLLM/BanglaLLama-3.2-1b-unolp-culturax-base-v0.0.1](https://huggingface.co/BanglaLLM/BanglaLLama-3.2-1b-unolp-culturax-base-v0.0.1) on an unknown dataset.
 ## Model description
@@ -140,6 +142,17 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_steps: 10
 - num_epochs: 1
 ### Framework versions
 - PEFT 0.12.0

 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/banglallm/banglallm-training/runs/balglallm-training-llama3.2-1b-finetuning-5Oct2024_10_00_AM-id-1)
 # BanglaLLama-3.2-1b-bangla-alpaca-orca-instruct-v0.0.1
+This model is a fine-tuned version of [BanglaLLM/BanglaLLama-3.2-1b-unolp-culturax-base-v0.0.1](https://huggingface.co/BanglaLLM/BanglaLLama-3.2-1b-unolp-culturax-base-v0.0.1) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.4511
 ## Model description
 - lr_scheduler_warmup_steps: 10
 - num_epochs: 1
+### Training results
+| Training Loss | Epoch  | Step | Validation Loss |
+|:-------------:|:------:|:----:|:---------------:|
+| 0.8315        | 0.0002 | 1    | 0.9496          |
+| 0.4253        | 0.2501 | 1424 | 0.5142          |
+| 0.487         | 0.5003 | 2848 | 0.4754          |
+| 0.4383        | 0.7504 | 4272 | 0.4550          |
+| 0.478         | 1.0006 | 5696 | 0.4511          |
 ### Framework versions
 - PEFT 0.12.0

adapter_model.bin ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:1f95a44ce90cdd17173f748d0a8d1c261d758aceb7fd5e52075407bfdd15771d
+size 1140932242