gardner commited on
Commit
453a20a
·
verified ·
1 Parent(s): 876a872

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -17
README.md CHANGED
@@ -6,10 +6,22 @@ tags:
6
  model-index:
7
  - name: TinyLlama-1.1B-DPO-Function-Calling-3T
8
  results: []
 
 
 
 
9
  ---
10
 
11
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
- should probably proofread and complete it, then remove this comment. -->
 
 
 
 
 
 
 
 
13
 
14
  [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
15
  <details><summary>See axolotl config</summary>
@@ -113,20 +125,6 @@ dataloader_pin_memory: true
113
 
114
  This model is a fine-tuned version of [gardner/TinyLlama-1.1B-SlimOrca-Function-Calling-3T](https://huggingface.co/gardner/TinyLlama-1.1B-SlimOrca-Function-Calling-3T) on the None dataset.
115
 
116
- ## Model description
117
-
118
- More information needed
119
-
120
- ## Intended uses & limitations
121
-
122
- More information needed
123
-
124
- ## Training and evaluation data
125
-
126
- More information needed
127
-
128
- ## Training procedure
129
-
130
  ### Training hyperparameters
131
 
132
  The following hyperparameters were used during training:
@@ -148,4 +146,4 @@ The following hyperparameters were used during training:
148
  - Transformers 4.37.0
149
  - Pytorch 2.1.2+cu121
150
  - Datasets 2.16.1
151
- - Tokenizers 0.15.0
 
6
  model-index:
7
  - name: TinyLlama-1.1B-DPO-Function-Calling-3T
8
  results: []
9
+ datasets:
10
+ - argilla/distilabel-intel-orca-dpo-pairs
11
+ language:
12
+ - en
13
  ---
14
 
15
+ ## TinyLlama-1.1B-DPO-Function-Calling-3T
16
+
17
+
18
+ This model is a DPO fine tune of [gardner/TinyLlama-1.1B-SlimOrca-Function-Calling-3T](https://huggingface.co/datasets/gardner/TinyLlama-1.1B-SlimOrca-Function-Calling-3T) which itself was trained on:
19
+
20
+ 1. [Open-Orca/SlimOrca-Dedup](https://huggingface.co/datasets/Open-Orca/SlimOrca-Dedup)
21
+ 1. [gardner/glaive-function-calling-v2-sharegpt](https://huggingface.co/datasets/gardner/glaive-function-calling-v2-sharegpt)
22
+
23
+ The model scores unusually high on GSM8K which indicates the glaive function calling dataset may introduce data contamination.
24
+
25
 
26
  [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
27
  <details><summary>See axolotl config</summary>
 
125
 
126
  This model is a fine-tuned version of [gardner/TinyLlama-1.1B-SlimOrca-Function-Calling-3T](https://huggingface.co/gardner/TinyLlama-1.1B-SlimOrca-Function-Calling-3T) on the None dataset.
127
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
128
  ### Training hyperparameters
129
 
130
  The following hyperparameters were used during training:
 
146
  - Transformers 4.37.0
147
  - Pytorch 2.1.2+cu121
148
  - Datasets 2.16.1
149
+ - Tokenizers 0.15.0