Spaces:
Running
Running
abdullahmeda
commited on
Update app.py
Browse files
app.py
CHANGED
@@ -136,6 +136,15 @@ with gr.Blocks() as demo:
|
|
136 |
predictions, and is therefore considered to be a better model. The training of LMs is carried out on large-scale text corpora, it can \
|
137 |
be considered that it has learned some common language patterns and text structures. Therefore, PPL can be used to measure how \
|
138 |
well a text conforms to common characteristics.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
139 |
|
140 |
### GLTR: Giant Language Model Test Room
|
141 |
This idea originates from the following paper: arxiv.org/pdf/1906.04043.pdf. It studies 3 tests to compute features of an input text. Their \
|
|
|
136 |
predictions, and is therefore considered to be a better model. The training of LMs is carried out on large-scale text corpora, it can \
|
137 |
be considered that it has learned some common language patterns and text structures. Therefore, PPL can be used to measure how \
|
138 |
well a text conforms to common characteristics.
|
139 |
+
|
140 |
+
I used all variants of the open-source GPT-2 model except xl size to compute the PPL (both text-level and sentence-level PPLs) of the \
|
141 |
+
collected texts. It is observed that, regardless of whether it is at the text level or the sentence level, the content generated by LLMs \
|
142 |
+
have relatively lower PPLs compared to the text written by humans. LLM captured common patterns and structures in the text it was trained on, \
|
143 |
+
and is very good at reproducing them. As a result, text generated by LLMs have relatively concentrated low PPLs.
|
144 |
+
|
145 |
+
Humans have the ability to express themselves in a wide variety of ways, depending on the context, audience, and purpose of the text they are \
|
146 |
+
writing. This can include using creative or imaginative elements, such as metaphors, similes, and unique word choices, which can make it more \
|
147 |
+
difficult for GPT2 to predict.
|
148 |
|
149 |
### GLTR: Giant Language Model Test Room
|
150 |
This idea originates from the following paper: arxiv.org/pdf/1906.04043.pdf. It studies 3 tests to compute features of an input text. Their \
|