HelpMum-Personal commited on
Commit
fdd84f8
·
verified ·
1 Parent(s): e8c2d10

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -16
README.md CHANGED
@@ -95,20 +95,16 @@ The training data consists of a diverse set of questions and answers related to
95
 
96
  The model was fine-tuned on the vaccination dataset using the following hyperparameters:
97
 
98
- - **Training regime:** Mixed precision (fp16)
99
- - **Batch Size:** 32
100
- - **Learning Rate:** 2e-5
101
- - **Epochs:** 5
 
102
 
103
  #### Preprocessing
104
 
105
  The data was cleaned and tokenized to ensure high-quality input for the model training process.
106
 
107
- #### Speeds, Sizes, Times
108
-
109
- - **Training Time:** Approximately 72 hours
110
- - **Checkpoint Size:** 8GB
111
-
112
  ## Evaluation
113
 
114
  ### Testing Data, Factors & Metrics
@@ -123,10 +119,10 @@ The evaluation considered various factors, including the accuracy and relevance
123
 
124
  #### Metrics
125
 
126
- - **Accuracy:** 92%
127
- - **Response Relevance:** 90%
128
- - **Average Latency:** 200ms
129
- - **Max Tokens per Response:** 150
130
 
131
  ### Results
132
 
@@ -148,9 +144,6 @@ The Vax-Llama-1 is a transformer-based language model built on the Llama3 archit
148
 
149
  ### Compute Infrastructure
150
 
151
- #### Hardware
152
-
153
- - **GPUs:** NVIDIA A100
154
 
155
  #### Software
156
 
 
95
 
96
  The model was fine-tuned on the vaccination dataset using the following hyperparameters:
97
 
98
+ - **Fine-Tuning Epochs:** 3
99
+ - **Batch Size:** 1 (per device for training and evaluation)
100
+ - **Learning Rate:** 2e-4
101
+ - **Max Tokens per Response:** 512
102
+
103
 
104
  #### Preprocessing
105
 
106
  The data was cleaned and tokenized to ensure high-quality input for the model training process.
107
 
 
 
 
 
 
108
  ## Evaluation
109
 
110
  ### Testing Data, Factors & Metrics
 
119
 
120
  #### Metrics
121
 
122
+ - **Loss:** 0.3554
123
+ - **Runtime:** 195.8647 seconds
124
+ - **Samples per Second:** 0.735
125
+
126
 
127
  ### Results
128
 
 
144
 
145
  ### Compute Infrastructure
146
 
 
 
 
147
 
148
  #### Software
149