HelpMum-Personal
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -95,20 +95,16 @@ The training data consists of a diverse set of questions and answers related to
|
|
95 |
|
96 |
The model was fine-tuned on the vaccination dataset using the following hyperparameters:
|
97 |
|
98 |
-
- **
|
99 |
-
- **Batch Size:**
|
100 |
-
- **Learning Rate:** 2e-
|
101 |
-
- **
|
|
|
102 |
|
103 |
#### Preprocessing
|
104 |
|
105 |
The data was cleaned and tokenized to ensure high-quality input for the model training process.
|
106 |
|
107 |
-
#### Speeds, Sizes, Times
|
108 |
-
|
109 |
-
- **Training Time:** Approximately 72 hours
|
110 |
-
- **Checkpoint Size:** 8GB
|
111 |
-
|
112 |
## Evaluation
|
113 |
|
114 |
### Testing Data, Factors & Metrics
|
@@ -123,10 +119,10 @@ The evaluation considered various factors, including the accuracy and relevance
|
|
123 |
|
124 |
#### Metrics
|
125 |
|
126 |
-
- **
|
127 |
-
- **
|
128 |
-
- **
|
129 |
-
|
130 |
|
131 |
### Results
|
132 |
|
@@ -148,9 +144,6 @@ The Vax-Llama-1 is a transformer-based language model built on the Llama3 archit
|
|
148 |
|
149 |
### Compute Infrastructure
|
150 |
|
151 |
-
#### Hardware
|
152 |
-
|
153 |
-
- **GPUs:** NVIDIA A100
|
154 |
|
155 |
#### Software
|
156 |
|
|
|
95 |
|
96 |
The model was fine-tuned on the vaccination dataset using the following hyperparameters:
|
97 |
|
98 |
+
- **Fine-Tuning Epochs:** 3
|
99 |
+
- **Batch Size:** 1 (per device for training and evaluation)
|
100 |
+
- **Learning Rate:** 2e-4
|
101 |
+
- **Max Tokens per Response:** 512
|
102 |
+
|
103 |
|
104 |
#### Preprocessing
|
105 |
|
106 |
The data was cleaned and tokenized to ensure high-quality input for the model training process.
|
107 |
|
|
|
|
|
|
|
|
|
|
|
108 |
## Evaluation
|
109 |
|
110 |
### Testing Data, Factors & Metrics
|
|
|
119 |
|
120 |
#### Metrics
|
121 |
|
122 |
+
- **Loss:** 0.3554
|
123 |
+
- **Runtime:** 195.8647 seconds
|
124 |
+
- **Samples per Second:** 0.735
|
125 |
+
|
126 |
|
127 |
### Results
|
128 |
|
|
|
144 |
|
145 |
### Compute Infrastructure
|
146 |
|
|
|
|
|
|
|
147 |
|
148 |
#### Software
|
149 |
|