jy46604790 commited on
Commit
4ea03fa
·
1 Parent(s): eda4315

Upload Part4.md

Browse files
Files changed (1) hide show
  1. Part4.md +10 -0
Part4.md ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ ## Critical Analysis
2
+
3
+ 1. The current model can only take first 500 words in the text due to training resources. It might cause problems when a long text is entered into the model which context is highly relevant.
4
+
5
+ 2. When the input text is too short, it often judges it as fake news due to our strategy of truncating first 500 words and padding short text.
6
+
7
+ ### Further improvement
8
+
9
+ 1. We may add more features such as title, author information to improve the model.
10
+ 2. The current pre-trained model RoBERTa is still quite slow when trainning on big datasets. Probably we can try other models and compare the runtime and performance of the models.