martimfasantos commited on
Commit
67b0f07
·
verified ·
1 Parent(s): 7a1aacb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -5
README.md CHANGED
@@ -1,20 +1,23 @@
1
  ---
2
  license: gemma
3
- base_model: martimfasantos/sft-sum-chosen-10lp-shuff-full-gemma
4
  tags:
5
  - summarization
6
  - generated_from_trainer
7
  model-index:
8
- - name: dpo-sum-10p-shuff-1e-7-full-from-sft-gemma
9
  results: []
 
 
 
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
  should probably proofread and complete it, then remove this comment. -->
14
 
15
- # dpo-sum-10p-shuff-1e-7-full-from-sft-gemma
16
 
17
- This model is a fine-tuned version of [martimfasantos/sft-sum-chosen-10lp-shuff-full-gemma](https://huggingface.co/martimfasantos/sft-sum-chosen-10lp-shuff-full-gemma) on the openai/summarize_from_feedback dataset.
18
 
19
  ## Model description
20
 
@@ -56,4 +59,4 @@ The following hyperparameters were used during training:
56
  - Transformers 4.43.3
57
  - Pytorch 2.3.1+cu121
58
  - Datasets 2.20.0
59
- - Tokenizers 0.19.1
 
1
  ---
2
  license: gemma
3
+ base_model: martimfasantos/gemma-2-2b-Sum-SFT
4
  tags:
5
  - summarization
6
  - generated_from_trainer
7
  model-index:
8
+ - name: gemma-2-2b-Sum-DPO
9
  results: []
10
+ datasets:
11
+ - openai/summarize_from_feedback
12
+ pipeline_tag: summarization
13
  ---
14
 
15
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
16
  should probably proofread and complete it, then remove this comment. -->
17
 
18
+ # gemma-2-2b-Sum-DPO
19
 
20
+ This model is a fine-tuned version of [martimfasantos/gemma-2-2b-Sum-SFT](https://huggingface.co/martimfasantos/gemma-2-2b-Sum-SFT) on the openai/summarize_from_feedback dataset.
21
 
22
  ## Model description
23
 
 
59
  - Transformers 4.43.3
60
  - Pytorch 2.3.1+cu121
61
  - Datasets 2.20.0
62
+ - Tokenizers 0.19.1