Tim-05369 commited on
Commit
d7fc545
·
verified ·
1 Parent(s): 06d376e

Tim-05369/orpo_train_using_space_demo

Browse files
Files changed (3) hide show
  1. README.md +13 -15
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -1,6 +1,4 @@
1
  ---
2
- license: apache-2.0
3
- base_model: TinyLlama/TinyLlama-1.1B-Chat-v1.0
4
  tags:
5
  - trl
6
  - orpo
@@ -15,20 +13,20 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  # results
17
 
18
- This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-Chat-v1.0](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 2.4477
21
- - Rewards/chosen: -0.1131
22
- - Rewards/rejected: -0.0946
23
- - Rewards/accuracies: 0.6667
24
- - Rewards/margins: -0.0185
25
- - Logps/rejected: -1.8929
26
- - Logps/chosen: -2.2621
27
- - Logits/rejected: -2.5533
28
- - Logits/chosen: -2.5157
29
- - Nll Loss: 2.3980
30
- - Log Odds Ratio: -0.9931
31
- - Log Odds Chosen: -0.3878
32
 
33
  ## Model description
34
 
 
1
  ---
 
 
2
  tags:
3
  - trl
4
  - orpo
 
13
 
14
  # results
15
 
16
+ This model was trained from scratch on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 1.1578
19
+ - Rewards/chosen: -0.0609
20
+ - Rewards/rejected: -0.0483
21
+ - Rewards/accuracies: 0.5
22
+ - Rewards/margins: -0.0126
23
+ - Logps/rejected: -0.9658
24
+ - Logps/chosen: -1.2181
25
+ - Logits/rejected: -1.9100
26
+ - Logits/chosen: -2.3033
27
+ - Nll Loss: 1.1151
28
+ - Log Odds Ratio: -0.8532
29
+ - Log Odds Chosen: -0.2694
30
 
31
  ## Model description
32
 
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d2a84ef621a9eedfa7f6f2b9b19ae685a5a721a37f0322c2134504785dacdc2c
3
  size 4400216536
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:271c150024b462dd3bc9cc9cba8917f9d264ba0e193ecedc0e1037c4a499561d
3
  size 4400216536
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:80166eae19da31ec096daf9e9ce36d5183712c11eda667881be34bf3390581aa
3
  size 5496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:75150a9bfbb937ba6d1d897eff1824aebce6c5767c42e4ad7db8f81cc1dbb1b3
3
  size 5496