Sentiment-google-t5-v1_1-large-inter_model-frequency-human_annots_str

This model is a fine-tuned version of google/t5-v1_1-large on the None dataset. It achieves the following results on the evaluation set:

Loss: 1.2676

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 0.0001
train_batch_size: 128
eval_batch_size: 128
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 200

Training results

Training Loss	Epoch	Step	Validation Loss
20.8675	1.0	44	24.9691
18.2491	2.0	88	16.9536
13.3557	3.0	132	11.4063
11.0809	4.0	176	10.9529
10.1527	5.0	220	10.7258
9.9235	6.0	264	10.5299
9.8499	7.0	308	10.3942
9.5638	8.0	352	10.0279
9.0218	9.0	396	9.4463
8.5704	10.0	440	9.0452
8.3982	11.0	484	8.8619
8.3396	12.0	528	8.7363
7.0951	13.0	572	1.3757
1.1349	14.0	616	1.0516
1.0735	15.0	660	1.0456
1.0787	16.0	704	1.0342
1.0796	17.0	748	1.0342
1.0476	18.0	792	1.0318
1.0566	19.0	836	1.0311
1.0472	20.0	880	1.0247
1.0401	21.0	924	1.0201
1.0725	22.0	968	1.0159
1.0447	23.0	1012	1.0180
1.0477	24.0	1056	1.0121
1.0357	25.0	1100	1.0109
1.0333	26.0	1144	1.0096
1.0282	27.0	1188	1.0078
1.0206	28.0	1232	1.0100
1.0241	29.0	1276	1.0081
1.023	30.0	1320	1.0053
0.9993	31.0	1364	1.0073
1.0104	32.0	1408	1.0079
1.0176	33.0	1452	1.0014
1.0157	34.0	1496	0.9977
1.0204	35.0	1540	0.9960
1.0174	36.0	1584	0.9967
1.0252	37.0	1628	0.9949
1.0076	38.0	1672	0.9913
1.0137	39.0	1716	0.9874
1.0151	40.0	1760	0.9856
0.9907	41.0	1804	0.9843
1.0147	42.0	1848	0.9803
1.001	43.0	1892	0.9777
1.0009	44.0	1936	0.9735
0.9881	45.0	1980	0.9731
0.9973	46.0	2024	0.9761
0.9982	47.0	2068	0.9888
0.9826	48.0	2112	1.0006
0.9739	49.0	2156	0.9766
0.9659	50.0	2200	0.9525
0.9534	51.0	2244	0.9400
0.959	52.0	2288	0.9553
0.9492	53.0	2332	0.9308
0.9629	54.0	2376	0.9325
0.9532	55.0	2420	0.9288
0.9586	56.0	2464	0.9233
0.9511	57.0	2508	0.9228
0.9456	58.0	2552	0.9178
0.937	59.0	2596	0.9140
0.9415	60.0	2640	0.9332
0.9364	61.0	2684	0.9073
0.9304	62.0	2728	0.9112
0.9418	63.0	2772	0.9073
0.9423	64.0	2816	0.9079
0.9277	65.0	2860	0.9062
0.9274	66.0	2904	0.8999
0.9266	67.0	2948	0.8971
0.9231	68.0	2992	0.9003
0.9174	69.0	3036	0.8994
0.9036	70.0	3080	0.8986
0.9112	71.0	3124	0.8925
0.8929	72.0	3168	0.8866
0.9069	73.0	3212	0.8840
0.8922	74.0	3256	0.8818
0.9079	75.0	3300	0.8821
0.8941	76.0	3344	0.8780
0.8952	77.0	3388	0.8824
0.8881	78.0	3432	0.8724
0.884	79.0	3476	0.8684
0.8761	80.0	3520	0.8715
0.8952	81.0	3564	0.8706
0.8871	82.0	3608	0.8654
0.8772	83.0	3652	0.8583
0.8745	84.0	3696	0.8570
0.8683	85.0	3740	0.8490
0.8698	86.0	3784	0.8500
0.8562	87.0	3828	0.8469
0.8636	88.0	3872	0.8465
0.8669	89.0	3916	0.8359
0.8422	90.0	3960	0.8418
0.8568	91.0	4004	0.8332
0.8628	92.0	4048	0.8338
0.8599	93.0	4092	0.8302
0.8471	94.0	4136	0.8235
0.8432	95.0	4180	0.8202
0.8389	96.0	4224	0.8159
0.8347	97.0	4268	0.8218
0.8353	98.0	4312	0.8141
0.8172	99.0	4356	0.8176
0.8303	100.0	4400	0.8078
0.8317	101.0	4444	0.8077
0.8203	102.0	4488	0.8103
0.8224	103.0	4532	0.8076
0.8174	104.0	4576	0.8023
0.8242	105.0	4620	0.7897
0.809	106.0	4664	0.7935
0.8014	107.0	4708	0.7881
0.817	108.0	4752	0.7815
0.7988	109.0	4796	0.7861
0.8003	110.0	4840	0.7716
0.7991	111.0	4884	0.7836
0.7851	112.0	4928	0.7722
0.7884	113.0	4972	0.7716
0.7831	114.0	5016	0.7643
0.7849	115.0	5060	0.7767
0.7846	116.0	5104	0.7602
0.7887	117.0	5148	0.7511
0.7683	118.0	5192	0.7480
0.7856	119.0	5236	0.7532
0.766	120.0	5280	0.7511
0.7663	121.0	5324	0.7490
0.7456	122.0	5368	0.7460
0.7672	123.0	5412	0.7464
0.7553	124.0	5456	0.7324
0.7543	125.0	5500	0.7296
0.7465	126.0	5544	0.7431
0.7525	127.0	5588	0.7310
0.7438	128.0	5632	0.7333
0.7521	129.0	5676	0.7218
0.7501	130.0	5720	0.7170
0.7485	131.0	5764	0.7214
0.7512	132.0	5808	0.7235
0.7554	133.0	5852	0.7140
0.7349	134.0	5896	0.7062
0.7542	135.0	5940	0.7095
0.7303	136.0	5984	0.7111
0.7163	137.0	6028	0.7004
0.7204	138.0	6072	0.7045
0.7091	139.0	6116	0.6918
0.719	140.0	6160	0.6976
0.726	141.0	6204	0.6885
0.7079	142.0	6248	0.6896
0.7043	143.0	6292	0.6966
0.7078	144.0	6336	0.6833
0.711	145.0	6380	0.6839
0.7014	146.0	6424	0.6685
0.7026	147.0	6468	0.6752
0.6927	148.0	6512	0.6802
0.6899	149.0	6556	0.6747
0.7059	150.0	6600	0.6733
0.6855	151.0	6644	0.6551
0.694	152.0	6688	0.6590
0.6896	153.0	6732	0.6568
0.6758	154.0	6776	0.6595
0.7058	155.0	6820	0.6506
0.6761	156.0	6864	0.6586
0.6837	157.0	6908	0.6526
0.6736	158.0	6952	0.6526
0.6738	159.0	6996	0.6434
0.685	160.0	7040	0.6382
0.664	161.0	7084	0.6374
0.6878	162.0	7128	0.6322
0.6552	163.0	7172	0.6338
0.6796	164.0	7216	0.6453
0.6712	165.0	7260	0.6284
0.6683	166.0	7304	0.6249
0.6577	167.0	7348	0.6359
0.6462	168.0	7392	0.6193
0.66	169.0	7436	0.6138
0.6476	170.0	7480	0.6224
0.6444	171.0	7524	0.6195
0.6478	172.0	7568	0.6136
0.6332	173.0	7612	0.5981
0.6456	174.0	7656	0.6004
0.6302	175.0	7700	0.6060
0.6337	176.0	7744	0.6024
0.6282	177.0	7788	0.5936
0.616	178.0	7832	0.5942
0.6324	179.0	7876	0.6038
0.6331	180.0	7920	0.5939
0.627	181.0	7964	0.5881
0.6313	182.0	8008	0.5874
0.626	183.0	8052	0.5868
0.6215	184.0	8096	0.5789
0.6138	185.0	8140	0.5830
0.6235	186.0	8184	0.5900
0.61	187.0	8228	0.5920
0.6218	188.0	8272	0.5830
0.6265	189.0	8316	0.5706
0.6126	190.0	8360	0.5776
0.608	191.0	8404	0.5738
0.6143	192.0	8448	0.5737
0.6065	193.0	8492	0.5714
0.6213	194.0	8536	0.5657
0.6004	195.0	8580	0.5660
0.6229	196.0	8624	0.5646
0.6073	197.0	8668	0.5704
0.6048	198.0	8712	0.5696
0.6008	199.0	8756	0.5619
0.6157	200.0	8800	0.5597

Framework versions

Transformers 4.34.0
Pytorch 2.1.0+cu121
Datasets 2.14.5
Tokenizers 0.14.1

owanr
/

Sentiment-google-t5-v1_1-large-inter_model-frequency-human_annots_str

Sentiment-google-t5-v1_1-large-inter_model-frequency-human_annots_str

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

Model tree for owanr/Sentiment-google-t5-v1_1-large-inter_model-frequency-human_annots_str

Evaluation results