derek-harnett commited on
Commit
91db81c
·
verified ·
1 Parent(s): 8f91a0d

update README

Browse files
Files changed (1) hide show
  1. README.md +8 -10
README.md CHANGED
@@ -10,27 +10,24 @@ model-index:
10
  results: []
11
  ---
12
 
13
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
- should probably proofread and complete it, then remove this comment. -->
15
-
16
  # movie-review-classifier
17
 
18
- This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
19
- It achieves the following results on the evaluation set:
20
- - Loss: 0.2743
21
- - F1: 0.9327
22
 
23
  ## Model description
24
 
25
- More information needed
 
 
 
26
 
27
  ## Intended uses & limitations
28
 
29
- More information needed
30
 
31
  ## Training and evaluation data
32
 
33
- More information needed
34
 
35
  ## Training procedure
36
 
@@ -44,6 +41,7 @@ The following hyperparameters were used during training:
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
  - num_epochs: 3
 
47
 
48
  ### Training results
49
 
 
10
  results: []
11
  ---
12
 
 
 
 
13
  # movie-review-classifier
14
 
15
+ This model classifies (text) movie reviews as either a 1 (*i.e.,* thumbs-up) or a 0 (*i.e.,* a thumbs-down).
 
 
 
16
 
17
  ## Model description
18
 
19
+ This model is a version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) that was fine-tuned on the [IMDB movie-review dataset](https://huggingface.co/datasets/stanfordnlp/imdb).
20
+ It achieves the following results on the evaluation set:
21
+ - Loss: 0.2743
22
+ - F1: 0.9327
23
 
24
  ## Intended uses & limitations
25
 
26
+ Training this model was completed as part of a project from a data science bootcamp. It is intended to be used perhaps by students and/or hobbyists.
27
 
28
  ## Training and evaluation data
29
 
30
+ This model was trained on the [IMDB movie-review dataset](https://huggingface.co/datasets/stanfordnlp/imdb), a set of highly polarized (*i.e.,* clearly positive or negative) movie reviews. The dataset contains 25k labelled train samples, 25k labelled test samples, and 50k unlabelled samples.
31
 
32
  ## Training procedure
33
 
 
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
  - num_epochs: 3
44
+ - weight_decay: 0.1
45
 
46
  ### Training results
47