pablocosta
commited on
Commit
·
ad1f206
1
Parent(s):
100ada3
Update README.md
Browse files
README.md
CHANGED
@@ -8,6 +8,9 @@ datasets:
|
|
8 |
- Twitter
|
9 |
---
|
10 |
|
|
|
|
|
|
|
11 |
|
12 |
# <a name="introduction"></a> BERTabaporu: a genre-specific pre-trained model of Portuguese-speaking social media
|
13 |
|
@@ -31,6 +34,12 @@ from transformers import AutoModel # or BertModel, for BERT without pretraining
|
|
31 |
model = AutoModelForPreTraining.from_pretrained('pablocosta/bertabaporu-base-uncased')
|
32 |
tokenizer = AutoTokenizer.from_pretrained('pablocosta/bertabaporu-base-uncased')
|
33 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
34 |
## Cite us
|
35 |
|
36 |
|
|
|
8 |
- Twitter
|
9 |
---
|
10 |
|
11 |
+
**Paper:** For more details, please refer to our paper: [BERTabaporu: Assessing a Genre-Specific Language Model for Portuguese NLP](https://aclanthology.org/2023.ranlp-1.24/)
|
12 |
+
|
13 |
+
|
14 |
|
15 |
# <a name="introduction"></a> BERTabaporu: a genre-specific pre-trained model of Portuguese-speaking social media
|
16 |
|
|
|
34 |
model = AutoModelForPreTraining.from_pretrained('pablocosta/bertabaporu-base-uncased')
|
35 |
tokenizer = AutoTokenizer.from_pretrained('pablocosta/bertabaporu-base-uncased')
|
36 |
```
|
37 |
+
|
38 |
+
|
39 |
+
|
40 |
+
|
41 |
+
|
42 |
+
|
43 |
## Cite us
|
44 |
|
45 |
|