hugo-albert commited on
Commit
e8b7340
·
verified ·
1 Parent(s): ac94926

Training complete

Browse files
Files changed (3) hide show
  1. README.md +6 -6
  2. adapter_model.bin +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [codellama/CodeLlama-7b-hf](https://huggingface.co/codellama/CodeLlama-7b-hf) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.4982
19
 
20
  ## Model description
21
 
@@ -38,8 +38,8 @@ The following hyperparameters were used during training:
38
  - train_batch_size: 2
39
  - eval_batch_size: 2
40
  - seed: 42
41
- - gradient_accumulation_steps: 16
42
- - total_train_batch_size: 32
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
  - lr_scheduler_warmup_steps: 100
@@ -49,9 +49,9 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
- | No log | 0.98 | 33 | 1.3304 |
53
- | No log | 1.99 | 67 | 0.8024 |
54
- | No log | 2.93 | 99 | 0.4982 |
55
 
56
 
57
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [codellama/CodeLlama-7b-hf](https://huggingface.co/codellama/CodeLlama-7b-hf) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.3878
19
 
20
  ## Model description
21
 
 
38
  - train_batch_size: 2
39
  - eval_batch_size: 2
40
  - seed: 42
41
+ - gradient_accumulation_steps: 8
42
+ - total_train_batch_size: 16
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
  - lr_scheduler_warmup_steps: 100
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
+ | No log | 0.99 | 67 | 0.8366 |
53
+ | No log | 2.0 | 135 | 0.4170 |
54
+ | No log | 2.98 | 201 | 0.3878 |
55
 
56
 
57
  ### Framework versions
adapter_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:991bfd1bde6d471de0ef46599f34adffb717f8b8c5ca74039d3072fdb3332e23
3
  size 33600906
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4114c71c4b4a9c09c18f2c4acc762b9eeb4f662728b83ad7945331c3e19f2913
3
  size 33600906
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a0afa51a00ec4f8cbf00766daf052b63380e78aea143a0133112687bc46ac057
3
  size 4536
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:89f43e98e82b89a9356e2b6464e7cbe84077365993a8ac943dac900b96f8ae18
3
  size 4536