hugo-albert
/

CodeLlama-7b-hf-finetuned-py-to-cpp

Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

hugo-albert commited on Dec 9, 2024

Commit

e8b7340

·

verified ·

1 Parent(s): ac94926

Training complete

Files changed (3) hide show

README.md +6 -6
adapter_model.bin +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [codellama/CodeLlama-7b-hf](https://huggingface.co/codellama/CodeLlama-7b-hf) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4982
 ## Model description
@@ -38,8 +38,8 @@ The following hyperparameters were used during training:
 - train_batch_size: 2
 - eval_batch_size: 2
 - seed: 42
-- gradient_accumulation_steps: 16
-- total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 100
@@ -49,9 +49,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 0.98  | 33   | 1.3304          |
-| No log        | 1.99  | 67   | 0.8024          |
-| No log        | 2.93  | 99   | 0.4982          |
 ### Framework versions

 This model is a fine-tuned version of [codellama/CodeLlama-7b-hf](https://huggingface.co/codellama/CodeLlama-7b-hf) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3878
 ## Model description
 - train_batch_size: 2
 - eval_batch_size: 2
 - seed: 42
+- gradient_accumulation_steps: 8
+- total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 100
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 0.99  | 67   | 0.8366          |
+| No log        | 2.0   | 135  | 0.4170          |
+| No log        | 2.98  | 201  | 0.3878          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:991bfd1bde6d471de0ef46599f34adffb717f8b8c5ca74039d3072fdb3332e23
 size 33600906

 version https://git-lfs.github.com/spec/v1
+oid sha256:4114c71c4b4a9c09c18f2c4acc762b9eeb4f662728b83ad7945331c3e19f2913
 size 33600906

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a0afa51a00ec4f8cbf00766daf052b63380e78aea143a0133112687bc46ac057
 size 4536

 version https://git-lfs.github.com/spec/v1
+oid sha256:89f43e98e82b89a9356e2b6464e7cbe84077365993a8ac943dac900b96f8ae18
 size 4536