Jón Daðason
commited on
Commit
·
191173e
1
Parent(s):
2fafb1d
Updated README.md
Browse files
README.md
CHANGED
@@ -6,14 +6,14 @@ license: cc-by-4.0
|
|
6 |
datasets:
|
7 |
- igc
|
8 |
- ic3
|
9 |
-
-
|
10 |
- mc4
|
11 |
---
|
12 |
|
13 |
# Icelandic-Norwegian ELECTRA-Small
|
14 |
This model was pretrained on the following corpora:
|
15 |
* The [Icelandic Gigaword Corpus](http://igc.arnastofnun.is/) (IGC)
|
16 |
-
* The
|
17 |
* The [Icelandic Crawled Corpus](https://huggingface.co/datasets/jonfd/ICC) (ICC)
|
18 |
* The [Multilingual Colossal Clean Crawled Corpus](https://huggingface.co/datasets/mc4) (mC4) - Icelandic and Norwegian text obtained from .is and .no domains, respectively
|
19 |
|
|
|
6 |
datasets:
|
7 |
- igc
|
8 |
- ic3
|
9 |
+
- jonfd/ICC
|
10 |
- mc4
|
11 |
---
|
12 |
|
13 |
# Icelandic-Norwegian ELECTRA-Small
|
14 |
This model was pretrained on the following corpora:
|
15 |
* The [Icelandic Gigaword Corpus](http://igc.arnastofnun.is/) (IGC)
|
16 |
+
* The Icelandic Common Crawl Corpus (IC3)
|
17 |
* The [Icelandic Crawled Corpus](https://huggingface.co/datasets/jonfd/ICC) (ICC)
|
18 |
* The [Multilingual Colossal Clean Crawled Corpus](https://huggingface.co/datasets/mc4) (mC4) - Icelandic and Norwegian text obtained from .is and .no domains, respectively
|
19 |
|