nicholasKluge
commited on
Commit
·
bdf2d2c
1
Parent(s):
7e2fbd0
Upload AIRA_FineTuning.ipynb
Browse files- AIRA_FineTuning.ipynb +6 -6
AIRA_FineTuning.ipynb
CHANGED
@@ -783,7 +783,7 @@
|
|
783 |
"id": "vSqKuRjIe8ru"
|
784 |
},
|
785 |
"source": [
|
786 |
-
"
|
787 |
]
|
788 |
},
|
789 |
{
|
@@ -809,7 +809,7 @@
|
|
809 |
"id": "0vxvcTIHe8rv"
|
810 |
},
|
811 |
"source": [
|
812 |
-
"
|
813 |
]
|
814 |
},
|
815 |
{
|
@@ -939,7 +939,7 @@
|
|
939 |
"id": "XAoXD7GVYMN_"
|
940 |
},
|
941 |
"source": [
|
942 |
-
"
|
943 |
]
|
944 |
},
|
945 |
{
|
@@ -998,7 +998,7 @@
|
|
998 |
"id": "GJ1kH3uSe8rw"
|
999 |
},
|
1000 |
"source": [
|
1001 |
-
"
|
1002 |
]
|
1003 |
},
|
1004 |
{
|
@@ -1037,7 +1037,7 @@
|
|
1037 |
"id": "nz1a9lxXe8rw"
|
1038 |
},
|
1039 |
"source": [
|
1040 |
-
"
|
1041 |
]
|
1042 |
},
|
1043 |
{
|
@@ -1635,7 +1635,7 @@
|
|
1635 |
"id": "h48iOsqie8rx"
|
1636 |
},
|
1637 |
"source": [
|
1638 |
-
"
|
1639 |
]
|
1640 |
},
|
1641 |
{
|
|
|
783 |
"id": "vSqKuRjIe8ru"
|
784 |
},
|
785 |
"source": [
|
786 |
+
"7. Create the `DataLoaders` and specify the `batch_size`."
|
787 |
]
|
788 |
},
|
789 |
{
|
|
|
809 |
"id": "0vxvcTIHe8rv"
|
810 |
},
|
811 |
"source": [
|
812 |
+
"8. Load the base model (`GPT2LMHeadModel`)."
|
813 |
]
|
814 |
},
|
815 |
{
|
|
|
939 |
"id": "XAoXD7GVYMN_"
|
940 |
},
|
941 |
"source": [
|
942 |
+
"9. Freeze some of the layers for constrained fine-tuning. This allows the model to retain some of its original capabilities after the tuning."
|
943 |
]
|
944 |
},
|
945 |
{
|
|
|
998 |
"id": "GJ1kH3uSe8rw"
|
999 |
},
|
1000 |
"source": [
|
1001 |
+
"10. Set the training hyperparameters."
|
1002 |
]
|
1003 |
},
|
1004 |
{
|
|
|
1037 |
"id": "nz1a9lxXe8rw"
|
1038 |
},
|
1039 |
"source": [
|
1040 |
+
"11. Training/Validation loop. Track the carbon emissions of your work by using `codecarbon`. 🌱"
|
1041 |
]
|
1042 |
},
|
1043 |
{
|
|
|
1635 |
"id": "h48iOsqie8rx"
|
1636 |
},
|
1637 |
"source": [
|
1638 |
+
"12. Check the training stats and plot the learning curves."
|
1639 |
]
|
1640 |
},
|
1641 |
{
|