Tim-05369 commited on
Commit
89d3223
·
verified ·
1 Parent(s): 292f00e

Tim-05369/orpo_train_using_space_demo

Browse files
Files changed (3) hide show
  1. README.md +15 -13
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -1,4 +1,6 @@
1
  ---
 
 
2
  tags:
3
  - trl
4
  - orpo
@@ -13,20 +15,20 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # results
15
 
16
- This model was trained from scratch on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 2.7473
19
- - Rewards/chosen: -0.1208
20
- - Rewards/rejected: -0.0591
21
- - Rewards/accuracies: 0.0
22
- - Rewards/margins: -0.0617
23
- - Logps/rejected: -1.1825
24
- - Logps/chosen: -2.4166
25
- - Logits/rejected: -2.2554
26
- - Logits/chosen: -2.5517
27
- - Nll Loss: 2.6563
28
- - Log Odds Ratio: -1.8199
29
- - Log Odds Chosen: -1.6127
30
 
31
  ## Model description
32
 
 
1
  ---
2
+ license: apache-2.0
3
+ base_model: TinyLlama/TinyLlama-1.1B-Chat-v1.0
4
  tags:
5
  - trl
6
  - orpo
 
15
 
16
  # results
17
 
18
+ This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-Chat-v1.0](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 1.8073
21
+ - Rewards/chosen: -0.0719
22
+ - Rewards/rejected: -0.1594
23
+ - Rewards/accuracies: 0.6667
24
+ - Rewards/margins: 0.0874
25
+ - Logps/rejected: -3.1874
26
+ - Logps/chosen: -1.4385
27
+ - Logits/rejected: -2.2859
28
+ - Logits/chosen: -2.4167
29
+ - Nll Loss: 1.7887
30
+ - Log Odds Ratio: -0.3720
31
+ - Log Odds Chosen: 2.6873
32
 
33
  ## Model description
34
 
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:33b44e9ee80b7b4d6082302c08b749ded15097de50d89e365d4e1ac0ec9c0ccc
3
  size 4400216536
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:74ddbbdceecb03ec6e89c6ae9a82ff27449b7190a035a4c92a99d41dc6a682c0
3
  size 4400216536
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bc2971db7610c3103eb27b87f68da49638853ed4e2be3209e0dadd8d0d86162e
3
  size 5496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ccba72281dc9a290f89ffb1ade8e85412a27e156a73f32f98f07081e5a0b7c57
3
  size 5496