Brian314 commited on
Commit
51815d7
·
1 Parent(s): 45ba1b3

Initial commit

Browse files
README.md ADDED
@@ -0,0 +1,152 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - latex-ocr
4
+ - math-ocr
5
+ - math-formula-recognition
6
+ - mfr
7
+ - pix2text
8
+ - p2t
9
+ - image-to-text
10
+ license: mit
11
+ library_name: transformers
12
+ ---
13
+
14
+ # Model Card: Pix2Text-MFR
15
+ Mathematical Formula Recognition (MFR) model from [Pix2Text (P2T)](https://github.com/breezedeus/Pix2Text).
16
+
17
+ ## Model Details / 模型细节
18
+
19
+ This MFR model utilizes the [TrOCR](https://huggingface.co/docs/transformers/model_doc/trocr) architecture developed by Microsoft, starting with its initial values and retrained using a dataset of mathematical formula images.
20
+ The resulting MFR model can be used to convert images of mathematical formulas into LaTeX text representation. More detailed can be found: [Pix2Text V1.0 New Release: The Best Open-Source Formula Recognition Model | Breezedeus.com](https://www.breezedeus.com/article/p2t-v1.0).
21
+
22
+
23
+ 此 MFR 模型使用了微软的 [TrOCR](https://huggingface.co/docs/transformers/model_doc/trocr) 架构,以其为初始值并利用数学公式图片数据集进行了重新训练。
24
+ 获得的 MFR 模型可用于把数学公式图片转换为 LaTeX 文本表示。更多细节请见:[Pix2Text V1.0 新版发布:最好的开源公式识别模型 | Breezedeus.com](https://www.breezedeus.com/article/p2t-v1.0)。
25
+
26
+
27
+
28
+ ## Usage and Limitations / 使用和限制
29
+
30
+ - **Purpose**: This model is a mathematical formula recognition model, capable of converting input images of mathematical formulas into LaTeX text representation.
31
+ - **Limitation**: Since the model is trained on images of mathematical formulas, it may not work when recognizing other types of images.
32
+
33
+
34
+ - **用途**:此模型为数学公式识别模型,它可以把输入的数学公式图片转换为 LaTeX 文本表示。
35
+ - **限制**:由于模型是在数学公式图片数据上训练的,它在识别其他类型的图片时可能无法工作。
36
+
37
+
38
+
39
+ ## Documents / 文档
40
+
41
+ - [Pix2Text V1.0 New Release: The Best Open-Source Formula Recognition Model | Breezedeus.com](https://www.breezedeus.com/article/p2t-v1.0) ;
42
+ - Pix2Text (P2T) Github: [breezedeus/pix2text](https://github.com/breezedeus/Pix2Text) ;
43
+ - Pix2Text Online Free Service: [p2t.breezedeus.com](https://p2t.breezedeus.com/) ;
44
+ - Pix2Text Online Docs: [Docs](https://pix2text.readthedocs.io) ;
45
+ - Pix2Text More: [breezedeus.com/pix2text](https://breezedeus.com/article/pix2text) ;
46
+ - Pix2Text Discard: https://discord.gg/GgD87WM8Tf
47
+
48
+
49
+ ## Examples / 示例
50
+
51
+ ### Printed Math Formula Images / 印刷体公式图片
52
+
53
+ ![printed-formula examples](https://www.notion.so/image/https%3A%2F%2Fprod-files-secure.s3.us-west-2.amazonaws.com%2F9341931a-53f0-48e1-b026-0f1ad17b457c%2F26046b54-ae87-4faa-ab18-9acda74fd920%2FUntitled.jpeg?table=block&id=f422e590-4465-4648-8edd-ce2e1b00d959)
54
+
55
+
56
+ ### Handwritten Math Formula Images / 印刷体公式图片
57
+
58
+ ![handwritten-formula examples](https://www.notion.so/image/https%3A%2F%2Fprod-files-secure.s3.us-west-2.amazonaws.com%2F9341931a-53f0-48e1-b026-0f1ad17b457c%2Fcdbebff6-5b01-4e2a-a6f0-250da5cc39fe%2FUntitled.jpeg?table=block&id=e1029b05-25c5-40f0-9c3e-505744c0afa5)
59
+
60
+
61
+ ## Model Use / 模型使用
62
+
63
+ ### Method 1: Using the model Directly
64
+
65
+ This method doesn't need to install pix2text, but can only recognize pure formula images.
66
+
67
+ 这种方法无需安装 pix2text,但只能识别纯公式图片。
68
+
69
+ ```python
70
+ #! pip install transformers>=4.37.0 pillow optimum[onnxruntime]
71
+ from PIL import Image
72
+ from transformers import TrOCRProcessor
73
+ from optimum.onnxruntime import ORTModelForVision2Seq
74
+
75
+ processor = TrOCRProcessor.from_pretrained('breezedeus/pix2text-mfr')
76
+ model = ORTModelForVision2Seq.from_pretrained('breezedeus/pix2text-mfr', use_cache=False)
77
+
78
+ image_fps = [
79
+ 'examples/example.jpg',
80
+ 'examples/42.png',
81
+ 'examples/0000186.png',
82
+ ]
83
+ images = [Image.open(fp).convert('RGB') for fp in image_fps]
84
+ pixel_values = processor(images=images, return_tensors="pt").pixel_values
85
+ generated_ids = model.generate(pixel_values)
86
+ generated_text = processor.batch_decode(generated_ids, skip_special_tokens=True)
87
+ print(f'generated_ids: {generated_ids}, \ngenerated text: {generated_text}')
88
+
89
+ ```
90
+
91
+ ### Method 2: Using Pix2Text
92
+
93
+ This method requires the installation of pix2text, utilizing the Mathematical Formula Detection model (MFD) within Pix2Text. It is capable of recognizing not only pure formula images but also mixed images containing text.
94
+
95
+ 这种方法需要安装 pix2text,借助 Pix2Text 中的数学公式检测模型(MFD),它不仅可以识别纯公式图片,还可以识别包含文本的混合图片。
96
+
97
+ ```bash
98
+ $ pip install pix2text>=1.1
99
+ ```
100
+
101
+ ```python
102
+ #! pip install pix2text>=1.1
103
+
104
+ from pix2text import Pix2Text, merge_line_texts
105
+
106
+ image_fps = [
107
+ 'examples/example.jpg',
108
+ 'examples/42.png',
109
+ 'examples/0000186.png',
110
+ ]
111
+ p2t = Pix2Text.from_config()
112
+ outs = p2t.recognize_formula(image_fps) # recognize pure formula images
113
+
114
+ outs2 = p2t.recognize('examples/mixed.jpg', file_type='text_formula', return_text=True, save_analysis_res='mixed-out.jpg') # recognize mixed images
115
+ print(outs2)
116
+ ```
117
+
118
+ ### Method 3: Notebook
119
+
120
+ Just try Pix2Text with this notebook: [https://github.com/breezedeus/Pix2Text/blob/main/pix2text_v1_1.ipynb](https://github.com/breezedeus/Pix2Text/blob/main/pix2text_v1_1.ipynb).
121
+
122
+
123
+ ## Performance / 性能
124
+
125
+ The original images for the test data are derived from real data uploaded by users on the [Pix2Text Online Service](https://p2t.breezedeus.com). Initially, real user data from a specific period is selected, and then the Mathematical Formula Detection model (MFD) within Pix2Text is used to detect the mathematical formulas in these images and crop the corresponding parts. A subset of these formula images is then randomly chosen for manual annotation to create the test dataset. The following image shows some sample pictures from the test dataset. It is evident that the images in the test dataset are quite diverse, including mathematical formulas of various lengths and complexities, from single letters to formula groups and even matrices. This test dataset includes `485` images.
126
+
127
+ 测试数据对应的原始图片来源于 [Pix2Text 网页版](https://p2t.breezedeus.com) 用户上传的真实数据。首先选取一段时间内用户的真实数据,然后利用 Pix2Text 中数学公式检测模型(MFD)检测出这些图片中的数学公式并截取出对应的部分,再从中随机选取部分公式图片进行人工标注。就获得了用于测试的测试数据集了。下图是测试数据集中的部分样例图片。从中可以看出测试数据集中的图片比较多样,包括了各种不同长度和复杂度的数学公式,有单个字母的图片,也有公式组甚至矩阵图片。本测试数据集包括了 `485` 张图片。
128
+
129
+ ![Examples from test data](https://www.notion.so/image/https%3A%2F%2Fprod-files-secure.s3.us-west-2.amazonaws.com%2F9341931a-53f0-48e1-b026-0f1ad17b457c%2Ffb23b2d4-cdcf-46c9-9095-027591402a54%2FUntitled.png?table=block&id=269900d5-299a-4dcd-a26c-6555e831caff)
130
+
131
+ Below are the Character Error Rates (CER, the lower, the better) of various models on this test dataset. For the true annotated results, as well as the output of each model, normalization was first performed to ensure that irrelevant factors such as spaces do not affect the test outcomes. For the recognition results of Texify, the leading and trailing symbols `$` or `$$` of the formula are removed first.
132
+
133
+ 以下是各个模型在此测试数据集上的 CER(字错误率,越小越好)。其中对真实标注结果,以及每个模型的输出都首先进行了标准化,以保证不会因为空格等无关因素影响测试结果。对 Texify 的识别结果会首先去掉公式的首尾符号$或$$。
134
+
135
+ ![CER Comparison Among Different MFR Models](https://www.notion.so/image/https%3A%2F%2Fprod-files-secure.s3.us-west-2.amazonaws.com%2F9341931a-53f0-48e1-b026-0f1ad17b457c%2F976b6c14-879d-4a3b-b027-6d2b15ce28b3%2FUntitled.png?table=block&id=6c503402-9b34-4937-a103-e4fd3bdbe754)
136
+
137
+ As can be seen from the figure above, the Pix2Text V1.0 MFR open-source free version model has significantly outperformed the previous versions of the paid model. Moreover, compared to the V1.0 MFR open-source free model, the precision of the Pix2Text V1.0 MFR paid model has been further improved.
138
+
139
+ 由上图可见,Pix2Text V1.0 MFR 开源免费版模型已经大大优于之前版本的付费模型。而相比 V1.0 MFR 开源免费模型,Pix2Text V1.0 MFR 付费模型精度得到了进一步的提升。
140
+
141
+ > [Texify](https://github.com/VikParuchuri/texify) is more suited for recognizing images with standard formatting. It performs poorly in recognizing images containing single letters. This is the main reason why Texify's performance on this test dataset is inferior to that of Latex-OCR.
142
+ >
143
+ > [Texify](https://github.com/VikParuchuri/texify) 更适用于识别标准排版的图片,它对包含单字母的图片识别较差。这也是 Texify 在此测试数据集上效果比 Latex-OCR 还差的主要原因。
144
+
145
+
146
+ ## Feedback / 反馈
147
+
148
+ > Where to send questions or comments about the model.
149
+
150
+ Welcome to contact the author [Breezedeus](https://www.breezedeus.com/article/join-group).
151
+
152
+ 欢迎联系作者 [Breezedeus](https://www.breezedeus.com/article/join-group) 。
config.json ADDED
@@ -0,0 +1,170 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "Brian314/pix2text-mfr-quantized",
3
+ "architectures": [
4
+ "VisionEncoderDecoderModel"
5
+ ],
6
+ "decoder": {
7
+ "_name_or_path": "",
8
+ "activation_dropout": 0.0,
9
+ "activation_function": "relu",
10
+ "add_cross_attention": true,
11
+ "architectures": null,
12
+ "attention_dropout": 0.0,
13
+ "bad_words_ids": null,
14
+ "begin_suppress_tokens": null,
15
+ "bos_token_id": 0,
16
+ "chunk_size_feed_forward": 0,
17
+ "classifier_dropout": 0.0,
18
+ "cross_attention_hidden_size": 384,
19
+ "d_model": 256,
20
+ "decoder_attention_heads": 8,
21
+ "decoder_ffn_dim": 1024,
22
+ "decoder_layerdrop": 0.0,
23
+ "decoder_layers": 6,
24
+ "decoder_start_token_id": 2,
25
+ "diversity_penalty": 0.0,
26
+ "do_sample": false,
27
+ "dropout": 0.1,
28
+ "early_stopping": false,
29
+ "encoder_no_repeat_ngram_size": 0,
30
+ "eos_token_id": 2,
31
+ "exponential_decay_length_penalty": null,
32
+ "finetuning_task": null,
33
+ "forced_bos_token_id": null,
34
+ "forced_eos_token_id": null,
35
+ "id2label": {
36
+ "0": "LABEL_0",
37
+ "1": "LABEL_1"
38
+ },
39
+ "init_std": 0.02,
40
+ "is_decoder": true,
41
+ "is_encoder_decoder": false,
42
+ "label2id": {
43
+ "LABEL_0": 0,
44
+ "LABEL_1": 1
45
+ },
46
+ "layernorm_embedding": true,
47
+ "length_penalty": 1.0,
48
+ "max_length": 20,
49
+ "max_position_embeddings": 512,
50
+ "min_length": 0,
51
+ "model_type": "trocr",
52
+ "no_repeat_ngram_size": 0,
53
+ "num_beam_groups": 1,
54
+ "num_beams": 1,
55
+ "num_return_sequences": 1,
56
+ "output_attentions": false,
57
+ "output_hidden_states": false,
58
+ "output_scores": false,
59
+ "pad_token_id": 1,
60
+ "prefix": null,
61
+ "problem_type": null,
62
+ "pruned_heads": {},
63
+ "remove_invalid_values": false,
64
+ "repetition_penalty": 1.0,
65
+ "return_dict": true,
66
+ "return_dict_in_generate": false,
67
+ "scale_embedding": true,
68
+ "sep_token_id": null,
69
+ "suppress_tokens": null,
70
+ "task_specific_params": null,
71
+ "temperature": 1.0,
72
+ "tf_legacy_loss": false,
73
+ "tie_encoder_decoder": false,
74
+ "tie_word_embeddings": false,
75
+ "tokenizer_class": null,
76
+ "top_k": 50,
77
+ "top_p": 1.0,
78
+ "torch_dtype": null,
79
+ "torchscript": false,
80
+ "typical_p": 1.0,
81
+ "use_bfloat16": false,
82
+ "use_cache": false,
83
+ "use_learned_position_embeddings": true,
84
+ "vocab_size": 1200
85
+ },
86
+ "decoder_start_token_id": 2,
87
+ "encoder": {
88
+ "_name_or_path": "",
89
+ "add_cross_attention": false,
90
+ "architectures": null,
91
+ "attention_probs_dropout_prob": 0.0,
92
+ "bad_words_ids": null,
93
+ "begin_suppress_tokens": null,
94
+ "bos_token_id": null,
95
+ "chunk_size_feed_forward": 0,
96
+ "cross_attention_hidden_size": null,
97
+ "decoder_start_token_id": null,
98
+ "diversity_penalty": 0.0,
99
+ "do_sample": false,
100
+ "early_stopping": false,
101
+ "encoder_no_repeat_ngram_size": 0,
102
+ "encoder_stride": 16,
103
+ "eos_token_id": null,
104
+ "exponential_decay_length_penalty": null,
105
+ "finetuning_task": null,
106
+ "forced_bos_token_id": null,
107
+ "forced_eos_token_id": null,
108
+ "hidden_act": "gelu",
109
+ "hidden_dropout_prob": 0.0,
110
+ "hidden_size": 384,
111
+ "id2label": {
112
+ "0": "LABEL_0",
113
+ "1": "LABEL_1"
114
+ },
115
+ "image_size": 384,
116
+ "initializer_range": 0.02,
117
+ "intermediate_size": 1536,
118
+ "is_decoder": false,
119
+ "is_encoder_decoder": false,
120
+ "label2id": {
121
+ "LABEL_0": 0,
122
+ "LABEL_1": 1
123
+ },
124
+ "layer_norm_eps": 1e-12,
125
+ "length_penalty": 1.0,
126
+ "max_length": 20,
127
+ "min_length": 0,
128
+ "model_type": "deit",
129
+ "no_repeat_ngram_size": 0,
130
+ "num_attention_heads": 6,
131
+ "num_beam_groups": 1,
132
+ "num_beams": 1,
133
+ "num_channels": 3,
134
+ "num_hidden_layers": 12,
135
+ "num_return_sequences": 1,
136
+ "output_attentions": false,
137
+ "output_hidden_states": false,
138
+ "output_scores": false,
139
+ "pad_token_id": null,
140
+ "patch_size": 16,
141
+ "prefix": null,
142
+ "problem_type": null,
143
+ "pruned_heads": {},
144
+ "qkv_bias": true,
145
+ "remove_invalid_values": false,
146
+ "repetition_penalty": 1.0,
147
+ "return_dict": true,
148
+ "return_dict_in_generate": false,
149
+ "sep_token_id": null,
150
+ "suppress_tokens": null,
151
+ "task_specific_params": null,
152
+ "temperature": 1.0,
153
+ "tf_legacy_loss": false,
154
+ "tie_encoder_decoder": false,
155
+ "tie_word_embeddings": true,
156
+ "tokenizer_class": null,
157
+ "top_k": 50,
158
+ "top_p": 1.0,
159
+ "torch_dtype": null,
160
+ "torchscript": false,
161
+ "typical_p": 1.0,
162
+ "use_bfloat16": false
163
+ },
164
+ "eos_token_id": 2,
165
+ "is_encoder_decoder": true,
166
+ "model_type": "vision-encoder-decoder",
167
+ "pad_token_id": 0,
168
+ "tie_word_embeddings": false,
169
+ "transformers_version": "4.37.0"
170
+ }
decoder_model.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fd0f92d7a012f3dae41e1ac79421aea0ea888b5a66cb3f9a004e424f82f3daed
3
+ size 30114937
encoder_model.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bc981519ead2f17d9fac2b001cf525928f58fe86ba423dc607cfa068df9540d5
3
+ size 23083150
generation_config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_from_model_config": true,
3
+ "bos_token_id": 1,
4
+ "decoder_start_token_id": 2,
5
+ "eos_token_id": 2,
6
+ "max_new_tokens": 512,
7
+ "pad_token_id": 0,
8
+ "transformers_version": "4.37.0",
9
+ "use_cache": false
10
+ }
preprocessor_config.json ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "crop_size": {
3
+ "height": 224,
4
+ "width": 224
5
+ },
6
+ "do_center_crop": false,
7
+ "do_normalize": true,
8
+ "do_rescale": true,
9
+ "do_resize": true,
10
+ "image_mean": [
11
+ 0.5,
12
+ 0.5,
13
+ 0.5
14
+ ],
15
+ "image_processor_type": "DeiTImageProcessor",
16
+ "image_std": [
17
+ 0.5,
18
+ 0.5,
19
+ 0.5
20
+ ],
21
+ "processor_class": "TrOCRProcessor",
22
+ "resample": 3,
23
+ "rescale_factor": 0.00392156862745098,
24
+ "size": {
25
+ "height": 384,
26
+ "width": 384
27
+ }
28
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,51 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<s>",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "cls_token": {
10
+ "content": "<s>",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "eos_token": {
17
+ "content": "</s>",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "mask_token": {
24
+ "content": "<mask>",
25
+ "lstrip": true,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "pad_token": {
31
+ "content": "<pad>",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ },
37
+ "sep_token": {
38
+ "content": "</s>",
39
+ "lstrip": false,
40
+ "normalized": false,
41
+ "rstrip": false,
42
+ "single_word": false
43
+ },
44
+ "unk_token": {
45
+ "content": "<unk>",
46
+ "lstrip": false,
47
+ "normalized": false,
48
+ "rstrip": false,
49
+ "single_word": false
50
+ }
51
+ }
tokenizer.json ADDED
@@ -0,0 +1,2218 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "version": "1.0",
3
+ "truncation": null,
4
+ "padding": null,
5
+ "added_tokens": [
6
+ {
7
+ "id": 0,
8
+ "content": "<pad>",
9
+ "single_word": false,
10
+ "lstrip": false,
11
+ "rstrip": false,
12
+ "normalized": false,
13
+ "special": true
14
+ },
15
+ {
16
+ "id": 1,
17
+ "content": "<s>",
18
+ "single_word": false,
19
+ "lstrip": false,
20
+ "rstrip": false,
21
+ "normalized": false,
22
+ "special": true
23
+ },
24
+ {
25
+ "id": 2,
26
+ "content": "</s>",
27
+ "single_word": false,
28
+ "lstrip": false,
29
+ "rstrip": false,
30
+ "normalized": false,
31
+ "special": true
32
+ },
33
+ {
34
+ "id": 3,
35
+ "content": "<unk>",
36
+ "single_word": false,
37
+ "lstrip": false,
38
+ "rstrip": false,
39
+ "normalized": false,
40
+ "special": true
41
+ },
42
+ {
43
+ "id": 4,
44
+ "content": "<mask>",
45
+ "single_word": false,
46
+ "lstrip": true,
47
+ "rstrip": false,
48
+ "normalized": false,
49
+ "special": true
50
+ }
51
+ ],
52
+ "normalizer": null,
53
+ "pre_tokenizer": {
54
+ "type": "ByteLevel",
55
+ "add_prefix_space": false,
56
+ "trim_offsets": true,
57
+ "use_regex": true
58
+ },
59
+ "post_processor": null,
60
+ "decoder": {
61
+ "type": "ByteLevel",
62
+ "add_prefix_space": true,
63
+ "trim_offsets": true,
64
+ "use_regex": true
65
+ },
66
+ "model": {
67
+ "type": "BPE",
68
+ "dropout": null,
69
+ "unk_token": null,
70
+ "continuing_subword_prefix": null,
71
+ "end_of_word_suffix": null,
72
+ "fuse_unk": false,
73
+ "byte_fallback": false,
74
+ "vocab": {
75
+ "<pad>": 0,
76
+ "<s>": 1,
77
+ "</s>": 2,
78
+ "<unk>": 3,
79
+ "<mask>": 4,
80
+ "!": 5,
81
+ "\"": 6,
82
+ "#": 7,
83
+ "$": 8,
84
+ "%": 9,
85
+ "&": 10,
86
+ "'": 11,
87
+ "(": 12,
88
+ ")": 13,
89
+ "*": 14,
90
+ "+": 15,
91
+ ",": 16,
92
+ "-": 17,
93
+ ".": 18,
94
+ "/": 19,
95
+ "0": 20,
96
+ "1": 21,
97
+ "2": 22,
98
+ "3": 23,
99
+ "4": 24,
100
+ "5": 25,
101
+ "6": 26,
102
+ "7": 27,
103
+ "8": 28,
104
+ "9": 29,
105
+ ":": 30,
106
+ ";": 31,
107
+ "<": 32,
108
+ "=": 33,
109
+ ">": 34,
110
+ "?": 35,
111
+ "@": 36,
112
+ "A": 37,
113
+ "B": 38,
114
+ "C": 39,
115
+ "D": 40,
116
+ "E": 41,
117
+ "F": 42,
118
+ "G": 43,
119
+ "H": 44,
120
+ "I": 45,
121
+ "J": 46,
122
+ "K": 47,
123
+ "L": 48,
124
+ "M": 49,
125
+ "N": 50,
126
+ "O": 51,
127
+ "P": 52,
128
+ "Q": 53,
129
+ "R": 54,
130
+ "S": 55,
131
+ "T": 56,
132
+ "U": 57,
133
+ "V": 58,
134
+ "W": 59,
135
+ "X": 60,
136
+ "Y": 61,
137
+ "Z": 62,
138
+ "[": 63,
139
+ "\\": 64,
140
+ "]": 65,
141
+ "^": 66,
142
+ "_": 67,
143
+ "`": 68,
144
+ "a": 69,
145
+ "b": 70,
146
+ "c": 71,
147
+ "d": 72,
148
+ "e": 73,
149
+ "f": 74,
150
+ "g": 75,
151
+ "h": 76,
152
+ "i": 77,
153
+ "j": 78,
154
+ "k": 79,
155
+ "l": 80,
156
+ "m": 81,
157
+ "n": 82,
158
+ "o": 83,
159
+ "p": 84,
160
+ "q": 85,
161
+ "r": 86,
162
+ "s": 87,
163
+ "t": 88,
164
+ "u": 89,
165
+ "v": 90,
166
+ "w": 91,
167
+ "x": 92,
168
+ "y": 93,
169
+ "z": 94,
170
+ "{": 95,
171
+ "|": 96,
172
+ "}": 97,
173
+ "~": 98,
174
+ "¡": 99,
175
+ "¢": 100,
176
+ "£": 101,
177
+ "¤": 102,
178
+ "¥": 103,
179
+ "¦": 104,
180
+ "§": 105,
181
+ "¨": 106,
182
+ "©": 107,
183
+ "ª": 108,
184
+ "«": 109,
185
+ "¬": 110,
186
+ "®": 111,
187
+ "¯": 112,
188
+ "°": 113,
189
+ "±": 114,
190
+ "²": 115,
191
+ "³": 116,
192
+ "´": 117,
193
+ "µ": 118,
194
+ "¶": 119,
195
+ "·": 120,
196
+ "¸": 121,
197
+ "¹": 122,
198
+ "º": 123,
199
+ "»": 124,
200
+ "¼": 125,
201
+ "½": 126,
202
+ "¾": 127,
203
+ "¿": 128,
204
+ "À": 129,
205
+ "Á": 130,
206
+ "Â": 131,
207
+ "Ã": 132,
208
+ "Ä": 133,
209
+ "Å": 134,
210
+ "Æ": 135,
211
+ "Ç": 136,
212
+ "È": 137,
213
+ "É": 138,
214
+ "Ê": 139,
215
+ "Ë": 140,
216
+ "Ì": 141,
217
+ "Í": 142,
218
+ "Î": 143,
219
+ "Ï": 144,
220
+ "Ð": 145,
221
+ "Ñ": 146,
222
+ "Ò": 147,
223
+ "Ó": 148,
224
+ "Ô": 149,
225
+ "Õ": 150,
226
+ "Ö": 151,
227
+ "×": 152,
228
+ "Ø": 153,
229
+ "Ù": 154,
230
+ "Ú": 155,
231
+ "Û": 156,
232
+ "Ü": 157,
233
+ "Ý": 158,
234
+ "Þ": 159,
235
+ "ß": 160,
236
+ "à": 161,
237
+ "á": 162,
238
+ "â": 163,
239
+ "ã": 164,
240
+ "ä": 165,
241
+ "å": 166,
242
+ "æ": 167,
243
+ "ç": 168,
244
+ "è": 169,
245
+ "é": 170,
246
+ "ê": 171,
247
+ "ë": 172,
248
+ "ì": 173,
249
+ "í": 174,
250
+ "î": 175,
251
+ "ï": 176,
252
+ "ð": 177,
253
+ "ñ": 178,
254
+ "ò": 179,
255
+ "ó": 180,
256
+ "ô": 181,
257
+ "õ": 182,
258
+ "ö": 183,
259
+ "÷": 184,
260
+ "ø": 185,
261
+ "ù": 186,
262
+ "ú": 187,
263
+ "û": 188,
264
+ "ü": 189,
265
+ "ý": 190,
266
+ "þ": 191,
267
+ "ÿ": 192,
268
+ "Ā": 193,
269
+ "ā": 194,
270
+ "Ă": 195,
271
+ "ă": 196,
272
+ "Ą": 197,
273
+ "ą": 198,
274
+ "Ć": 199,
275
+ "ć": 200,
276
+ "Ĉ": 201,
277
+ "ĉ": 202,
278
+ "Ċ": 203,
279
+ "ċ": 204,
280
+ "Č": 205,
281
+ "č": 206,
282
+ "Ď": 207,
283
+ "ď": 208,
284
+ "Đ": 209,
285
+ "đ": 210,
286
+ "Ē": 211,
287
+ "ē": 212,
288
+ "Ĕ": 213,
289
+ "ĕ": 214,
290
+ "Ė": 215,
291
+ "ė": 216,
292
+ "Ę": 217,
293
+ "ę": 218,
294
+ "Ě": 219,
295
+ "ě": 220,
296
+ "Ĝ": 221,
297
+ "ĝ": 222,
298
+ "Ğ": 223,
299
+ "ğ": 224,
300
+ "Ġ": 225,
301
+ "ġ": 226,
302
+ "Ģ": 227,
303
+ "ģ": 228,
304
+ "Ĥ": 229,
305
+ "ĥ": 230,
306
+ "Ħ": 231,
307
+ "ħ": 232,
308
+ "Ĩ": 233,
309
+ "ĩ": 234,
310
+ "Ī": 235,
311
+ "ī": 236,
312
+ "Ĭ": 237,
313
+ "ĭ": 238,
314
+ "Į": 239,
315
+ "į": 240,
316
+ "İ": 241,
317
+ "ı": 242,
318
+ "IJ": 243,
319
+ "ij": 244,
320
+ "Ĵ": 245,
321
+ "ĵ": 246,
322
+ "Ķ": 247,
323
+ "ķ": 248,
324
+ "ĸ": 249,
325
+ "Ĺ": 250,
326
+ "ĺ": 251,
327
+ "Ļ": 252,
328
+ "ļ": 253,
329
+ "Ľ": 254,
330
+ "ľ": 255,
331
+ "Ŀ": 256,
332
+ "ŀ": 257,
333
+ "Ł": 258,
334
+ "ł": 259,
335
+ "Ń": 260,
336
+ "Ġ}": 261,
337
+ "Ġ{": 262,
338
+ "Ġ\\": 263,
339
+ "Ġ_": 264,
340
+ "Ġ^": 265,
341
+ "Ġ)": 266,
342
+ "Ġ(": 267,
343
+ "Ġ1": 268,
344
+ "Ġ2": 269,
345
+ "Ġ=": 270,
346
+ "Ġ,": 271,
347
+ "Ġ-": 272,
348
+ "ra": 273,
349
+ "Ġ+": 274,
350
+ "ma": 275,
351
+ "rac": 276,
352
+ "frac": 277,
353
+ "ta": 278,
354
+ "Ġ0": 279,
355
+ "ig": 280,
356
+ "Ġi": 281,
357
+ "Ġx": 282,
358
+ "al": 283,
359
+ "Ġn": 284,
360
+ "le": 285,
361
+ "ar": 286,
362
+ "me": 287,
363
+ "th": 288,
364
+ "in": 289,
365
+ "Ġ.": 290,
366
+ "ft": 291,
367
+ "ph": 292,
368
+ "Ġt": 293,
369
+ "Ġk": 294,
370
+ "math": 295,
371
+ "Ġd": 296,
372
+ "igh": 297,
373
+ "righ": 298,
374
+ "am": 299,
375
+ "Ġa": 300,
376
+ "ti": 301,
377
+ "eta": 302,
378
+ "left": 303,
379
+ "right": 304,
380
+ "Ġm": 305,
381
+ "Ġj": 306,
382
+ "Ġ\\,": 307,
383
+ "Ġp": 308,
384
+ "de": 309,
385
+ "Ġs": 310,
386
+ "Ġr": 311,
387
+ "Ġ3": 312,
388
+ "Ġe": 313,
389
+ "si": 314,
390
+ "Ġ|": 315,
391
+ "ri": 316,
392
+ "mu": 317,
393
+ "Ġc": 318,
394
+ "Ġ&": 319,
395
+ "pha": 320,
396
+ "alpha": 321,
397
+ "da": 322,
398
+ "Ġl": 323,
399
+ "ot": 324,
400
+ "dot": 325,
401
+ "bda": 326,
402
+ "ambda": 327,
403
+ "Ġu": 328,
404
+ "Ġf": 329,
405
+ "Ġg": 330,
406
+ "ĠA": 331,
407
+ "Ġz": 332,
408
+ "Ġ4": 333,
409
+ "psi": 334,
410
+ "pri": 335,
411
+ "prime": 336,
412
+ "Ġ\\\\": 337,
413
+ "Ġy": 338,
414
+ "par": 339,
415
+ "tial": 340,
416
+ "partial": 341,
417
+ "to": 342,
418
+ "phi": 343,
419
+ "rm": 344,
420
+ "Ġb": 345,
421
+ "mathrm": 346,
422
+ "na": 347,
423
+ "lta": 348,
424
+ "Ġq": 349,
425
+ "su": 350,
426
+ "amma": 351,
427
+ "lambda": 352,
428
+ "cal": 353,
429
+ "ĠN": 354,
430
+ "Ġ~": 355,
431
+ "pi": 356,
432
+ "sum": 357,
433
+ "op": 358,
434
+ "ĠT": 359,
435
+ "nu": 360,
436
+ "Ġ]": 361,
437
+ "lo": 362,
438
+ "var": 363,
439
+ "ope": 364,
440
+ "tor": 365,
441
+ "rator": 366,
442
+ "operator": 367,
443
+ "name": 368,
444
+ "operatorname": 369,
445
+ "int": 370,
446
+ "igma": 371,
447
+ "Ġv": 372,
448
+ "ga": 373,
449
+ "mega": 374,
450
+ "ĠB": 375,
451
+ "ĠR": 376,
452
+ "beta": 377,
453
+ "rt": 378,
454
+ "Ġh": 379,
455
+ "ĠL": 380,
456
+ "ve": 381,
457
+ "Ġ[": 382,
458
+ "Ġ/": 383,
459
+ "lon": 384,
460
+ "bf": 385,
461
+ "ĠM": 386,
462
+ "theta": 387,
463
+ "delta": 388,
464
+ "ĠD": 389,
465
+ "ĠX": 390,
466
+ "ng": 391,
467
+ "Ġ\\;": 392,
468
+ "psilon": 393,
469
+ "bar": 394,
470
+ "sigma": 395,
471
+ "gamma": 396,
472
+ "ngle": 397,
473
+ "ĠC": 398,
474
+ "ĠF": 399,
475
+ "ĠS": 400,
476
+ "lde": 401,
477
+ "tilde": 402,
478
+ "cdot": 403,
479
+ "epsilon": 404,
480
+ "ĠH": 405,
481
+ "sq": 406,
482
+ "ĠP": 407,
483
+ "hi": 408,
484
+ "ho": 409,
485
+ "rho": 410,
486
+ "ray": 411,
487
+ "array": 412,
488
+ "la": 413,
489
+ "ha": 414,
490
+ "hat": 415,
491
+ "be": 416,
492
+ "nd": 417,
493
+ "sqrt": 418,
494
+ "Ġo": 419,
495
+ "end": 420,
496
+ "gin": 421,
497
+ "begin": 422,
498
+ "qu": 423,
499
+ "tau": 424,
500
+ "inft": 425,
501
+ "infty": 426,
502
+ "ĠE": 427,
503
+ "xi": 428,
504
+ "ĠV": 429,
505
+ "ĠI": 430,
506
+ "Ġ5": 431,
507
+ "omega": 432,
508
+ "ĠG": 433,
509
+ "Ġ*": 434,
510
+ "Ġw": 435,
511
+ "mathcal": 436,
512
+ "big": 437,
513
+ "ro": 438,
514
+ "ge": 439,
515
+ "Ġ:": 440,
516
+ "mes": 441,
517
+ "times": 442,
518
+ "leq": 443,
519
+ "ad": 444,
520
+ "ĠQ": 445,
521
+ "quad": 446,
522
+ "ĠK": 447,
523
+ "mathbf": 448,
524
+ "dots": 449,
525
+ "Ġ6": 450,
526
+ "De": 451,
527
+ "Delta": 452,
528
+ "ap": 453,
529
+ "varphi": 454,
530
+ "Ġ\\}": 455,
531
+ "ll": 456,
532
+ "ĠJ": 457,
533
+ "ĠU": 458,
534
+ "Gamma": 459,
535
+ "rangle": 460,
536
+ "Phi": 461,
537
+ "lin": 462,
538
+ "line": 463,
539
+ "tri": 464,
540
+ "rline": 465,
541
+ "Ġ\\{": 466,
542
+ "Ġ;": 467,
543
+ "ĠW": 468,
544
+ "Omega": 469,
545
+ "ĠZ": 470,
546
+ "matri": 471,
547
+ "matrix": 472,
548
+ "ove": 473,
549
+ "Ġ8": 474,
550
+ "Ġ\\|": 475,
551
+ "app": 476,
552
+ "\\{": 477,
553
+ "se": 478,
554
+ "ell": 479,
555
+ "vec": 480,
556
+ "Big": 481,
557
+ "Lambda": 482,
558
+ "pm": 483,
559
+ "bb": 484,
560
+ "mathbb": 485,
561
+ "chi": 486,
562
+ "Ġ<": 487,
563
+ "ĠY": 488,
564
+ "bla": 489,
565
+ "nabla": 490,
566
+ "langle": 491,
567
+ "tar": 492,
568
+ "overline": 493,
569
+ "varepsilon": 494,
570
+ "ĠO": 495,
571
+ "row": 496,
572
+ "Ġ>": 497,
573
+ "wi": 498,
574
+ "wide": 499,
575
+ "ed": 500,
576
+ "appa": 501,
577
+ "kappa": 502,
578
+ "cdots": 503,
579
+ "tarrow": 504,
580
+ "ldots": 505,
581
+ "Psi": 506,
582
+ "zeta": 507,
583
+ "Ġ\\!": 508,
584
+ "geq": 509,
585
+ "Ġ7": 510,
586
+ "ex": 511,
587
+ "rightarrow": 512,
588
+ "iv": 513,
589
+ "otimes": 514,
590
+ "bo": 515,
591
+ "Ġ9": 516,
592
+ "bol": 517,
593
+ "st": 518,
594
+ "qquad": 519,
595
+ "pro": 520,
596
+ "rc": 521,
597
+ "prod": 522,
598
+ "equ": 523,
599
+ "equiv": 524,
600
+ "set": 525,
601
+ "ci": 526,
602
+ "circ": 527,
603
+ "vert": 528,
604
+ "widetilde": 529,
605
+ "ca": 530,
606
+ "ses": 531,
607
+ "cases": 532,
608
+ "Sigma": 533,
609
+ "Ġ!": 534,
610
+ "dag": 535,
611
+ "co": 536,
612
+ "tex": 537,
613
+ "wed": 538,
614
+ "wedge": 539,
615
+ "\\}": 540,
616
+ "bigg": 541,
617
+ "text": 542,
618
+ "un": 543,
619
+ "sin": 544,
620
+ "unde": 545,
621
+ "sim": 546,
622
+ "ger": 547,
623
+ "dagger": 548,
624
+ "eq": 549,
625
+ "ign": 550,
626
+ "align": 551,
627
+ "aligned": 552,
628
+ "us": 553,
629
+ "Pi": 554,
630
+ "cos": 555,
631
+ "up": 556,
632
+ "widehat": 557,
633
+ "cap": 558,
634
+ "mi": 559,
635
+ "ast": 560,
636
+ "Th": 561,
637
+ "Theta": 562,
638
+ "Ġ\\:": 563,
639
+ "bold": 564,
640
+ "ck": 565,
641
+ "neq": 566,
642
+ "mid": 567,
643
+ "sub": 568,
644
+ "cup": 569,
645
+ "om": 570,
646
+ "brac": 571,
647
+ "bin": 572,
648
+ "binom": 573,
649
+ "fra": 574,
650
+ "mathfra": 575,
651
+ "mathfrak": 576,
652
+ "ln": 577,
653
+ "lus": 578,
654
+ "sy": 579,
655
+ "sym": 580,
656
+ "hbar": 581,
657
+ "symbol": 582,
658
+ "boldsymbol": 583,
659
+ "oplus": 584,
660
+ "sp": 585,
661
+ "angle": 586,
662
+ "underline": 587,
663
+ "exp": 588,
664
+ "fo": 589,
665
+ "rox": 590,
666
+ "approx": 591,
667
+ "re": 592,
668
+ "rall": 593,
669
+ "forall": 594,
670
+ "subset": 595,
671
+ "\\|": 596,
672
+ "er": 597,
673
+ "yle": 598,
674
+ "style": 599,
675
+ "brace": 600,
676
+ "Ve": 601,
677
+ "sta": 602,
678
+ "stack": 603,
679
+ "Vert": 604,
680
+ "star": 605,
681
+ "per": 606,
682
+ "perp": 607,
683
+ "io": 608,
684
+ "iota": 609,
685
+ "under": 610,
686
+ "over": 611,
687
+ "vartheta": 612,
688
+ "or": 613,
689
+ "bigl": 614,
690
+ "bigr": 615,
691
+ "textrm": 616,
692
+ "di": 617,
693
+ "not": 618,
694
+ "sf": 619,
695
+ "mp": 620,
696
+ "ac": 621,
697
+ "Xi": 622,
698
+ "log": 623,
699
+ "Bigr": 624,
700
+ "rel": 625,
701
+ "meq": 626,
702
+ "simeq": 627,
703
+ "Bigl": 628,
704
+ "stackrel": 629,
705
+ "flo": 630,
706
+ "floor": 631,
707
+ "lay": 632,
708
+ "splay": 633,
709
+ "display": 634,
710
+ "displaystyle": 635,
711
+ "spac": 636,
712
+ "space": 637,
713
+ "Bigg": 638,
714
+ "inus": 639,
715
+ "varrho": 640,
716
+ "vdots": 641,
717
+ "long": 642,
718
+ "Ġ'": 643,
719
+ "minus": 644,
720
+ "textbf": 645,
721
+ "setminus": 646,
722
+ "biggr": 647,
723
+ "biggl": 648,
724
+ "div": 649,
725
+ "mathsf": 650,
726
+ "Upsilon": 651,
727
+ "sc": 652,
728
+ "sla": 653,
729
+ "hspace": 654,
730
+ "lle": 655,
731
+ "sh": 656,
732
+ "tan": 657,
733
+ "longrightarrow": 658,
734
+ "triangle": 659,
735
+ "pt": 660,
736
+ "ript": 661,
737
+ "script": 662,
738
+ "ddot": 663,
739
+ "underbrace": 664,
740
+ "sinh": 665,
741
+ "brack": 666,
742
+ "sto": 667,
743
+ "psto": 668,
744
+ "mapsto": 669,
745
+ "ch": 670,
746
+ "top": 671,
747
+ "substack": 672,
748
+ "leftrightarrow": 673,
749
+ "eck": 674,
750
+ "check": 675,
751
+ "cosh": 676,
752
+ "bigcup": 677,
753
+ "lor": 678,
754
+ "sup": 679,
755
+ "supset": 680,
756
+ "oint": 681,
757
+ "bu": 682,
758
+ "varpi": 683,
759
+ "rfloor": 684,
760
+ "li": 685,
761
+ "notin": 686,
762
+ "upsilon": 687,
763
+ "lfloor": 688,
764
+ "sqcup": 689,
765
+ "Ġ\\#": 690,
766
+ "vee": 691,
767
+ "subseteq": 692,
768
+ "boldmath": 693,
769
+ "od": 694,
770
+ "parrow": 695,
771
+ "textstyle": 696,
772
+ "pr": 697,
773
+ "arrow": 698,
774
+ "ddots": 699,
775
+ "colon": 700,
776
+ "uparrow": 701,
777
+ "bigoplus": 702,
778
+ "llet": 703,
779
+ "bullet": 704,
780
+ "lim": 705,
781
+ "nt": 706,
782
+ "opto": 707,
783
+ "propto": 708,
784
+ "overrightarrow": 709,
785
+ "slant": 710,
786
+ "overbrace": 711,
787
+ "it": 712,
788
+ "Righ": 713,
789
+ "Rightarrow": 714,
790
+ "ow": 715,
791
+ "own": 716,
792
+ "il": 717,
793
+ "bot": 718,
794
+ "slash": 719,
795
+ "underset": 720,
796
+ "down": 721,
797
+ "imath": 722,
798
+ "sqcap": 723,
799
+ "Re": 724,
800
+ "varsigma": 725,
801
+ "arp": 726,
802
+ "cong": 727,
803
+ "eil": 728,
804
+ "downarrow": 729,
805
+ "lef": 730,
806
+ "overset": 731,
807
+ "circle": 732,
808
+ "det": 733,
809
+ "leftarrow": 734,
810
+ "ss": 735,
811
+ "emp": 736,
812
+ "ty": 737,
813
+ "empty": 738,
814
+ "emptyset": 739,
815
+ "no": 740,
816
+ "scriptstyle": 741,
817
+ "circled": 742,
818
+ "ara": 743,
819
+ "para": 744,
820
+ "llel": 745,
821
+ "parallel": 746,
822
+ "textcircled": 747,
823
+ "bre": 748,
824
+ "Ġ--": 749,
825
+ "Im": 750,
826
+ "leqslant": 751,
827
+ "breve": 752,
828
+ "ack": 753,
829
+ "back": 754,
830
+ "varPhi": 755,
831
+ "backslash": 756,
832
+ "gg": 757,
833
+ "textup": 758,
834
+ "less": 759,
835
+ "box": 760,
836
+ "ne": 761,
837
+ "varDelta": 762,
838
+ "sharp": 763,
839
+ "varGamma": 764,
840
+ "ze": 765,
841
+ "lesssim": 766,
842
+ "size": 767,
843
+ "mal": 768,
844
+ "nto": 769,
845
+ "phanto": 770,
846
+ "phantom": 771,
847
+ "varTheta": 772,
848
+ "ut": 773,
849
+ "varPsi": 774,
850
+ "nor": 775,
851
+ "normal": 776,
852
+ "varUpsilon": 777,
853
+ "varOmega": 778,
854
+ "mbe": 779,
855
+ "numbe": 780,
856
+ "number": 781,
857
+ "nonumber": 782,
858
+ "atop": 783,
859
+ "bigcap": 784,
860
+ "mod": 785,
861
+ "wp": 786,
862
+ "tt": 787,
863
+ "varLambda": 788,
864
+ "xist": 789,
865
+ "xists": 790,
866
+ "ct": 791,
867
+ "exists": 792,
868
+ "varXi": 793,
869
+ "Lo": 794,
870
+ "varSigma": 795,
871
+ "textnormal": 796,
872
+ "pmod": 797,
873
+ "Long": 798,
874
+ "varPi": 799,
875
+ "ect": 800,
876
+ "jmath": 801,
877
+ "rvert": 802,
878
+ "lvert": 803,
879
+ "ing": 804,
880
+ "rbrace": 805,
881
+ "Biggr": 806,
882
+ "sma": 807,
883
+ "Biggl": 808,
884
+ "ject": 809,
885
+ "bject": 810,
886
+ "acut": 811,
887
+ "acute": 812,
888
+ "scriptsize": 813,
889
+ "rceil": 814,
890
+ "small": 815,
891
+ "lbrack": 816,
892
+ "ceil": 817,
893
+ "lceil": 818,
894
+ "tin": 819,
895
+ "tiny": 820,
896
+ "scriptscriptstyle": 821,
897
+ "odot": 822,
898
+ "iint": 823,
899
+ "Le": 824,
900
+ "cot": 825,
901
+ "Left": 826,
902
+ "Leftrightarrow": 827,
903
+ "arc": 828,
904
+ "mathr": 829,
905
+ "cfrac": 830,
906
+ "ref": 831,
907
+ "mathring": 832,
908
+ "ker": 833,
909
+ "pre": 834,
910
+ "en": 835,
911
+ "mathit": 836,
912
+ "bmod": 837,
913
+ "ond": 838,
914
+ "overleftarrow": 839,
915
+ "hline": 840,
916
+ "Longrightarrow": 841,
917
+ "pmb": 842,
918
+ "amond": 843,
919
+ "kern": 844,
920
+ "sk": 845,
921
+ "geqslant": 846,
922
+ "diamond": 847,
923
+ "ip": 848,
924
+ "skip": 849,
925
+ "arctan": 850,
926
+ "xrightarrow": 851,
927
+ "mathtt": 852,
928
+ "lbrace": 853,
929
+ "eqq": 854,
930
+ "lar": 855,
931
+ "prec": 856,
932
+ "coloneqq": 857,
933
+ "lVert": 858,
934
+ "fla": 859,
935
+ "flat": 860,
936
+ "rVert": 861,
937
+ "arpo": 862,
938
+ "bigwedge": 863,
939
+ "dotsc": 864,
940
+ "gra": 865,
941
+ "grave": 866,
942
+ "object": 867,
943
+ "land": 868,
944
+ "ĠObject": 869,
945
+ "eph": 870,
946
+ "aleph": 871,
947
+ "underrightarrow": 872,
948
+ "overleftrightarrow": 873,
949
+ "Longleftrightarrow": 874,
950
+ "rbrack": 875,
951
+ "arpoo": 876,
952
+ "tanh": 877,
953
+ "underleftarrow": 878,
954
+ "tit": 879,
955
+ "textit": 880,
956
+ "enspace": 881,
957
+ "Bo": 882,
958
+ "underleftrightarrow": 883,
959
+ "Box": 884,
960
+ "ni": 885,
961
+ "squ": 886,
962
+ "triangleq": 887,
963
+ "mathop": 888,
964
+ "ff": 889,
965
+ "iff": 890,
966
+ "bigtriangle": 891,
967
+ "mit": 892,
968
+ "bigotimes": 893,
969
+ "neg": 894,
970
+ "eqref": 895,
971
+ "are": 896,
972
+ "square": 897,
973
+ "coth": 898,
974
+ "\\,": 899,
975
+ "dash": 900,
976
+ "boxtimes": 901,
977
+ "hdots": 902,
978
+ "tharpoo": 903,
979
+ "underbar": 904,
980
+ "dim": 905,
981
+ "textsf": 906,
982
+ "cc": 907,
983
+ "varkappa": 908,
984
+ "ere": 909,
985
+ "there": 910,
986
+ "fore": 911,
987
+ "therefore": 912,
988
+ "imp": 913,
989
+ "impli": 914,
990
+ "es": 915,
991
+ "implies": 916,
992
+ "Ġ\\%": 917,
993
+ "sb": 918,
994
+ "longleftrightarrow": 919,
995
+ "succ": 920,
996
+ "fbox": 921,
997
+ "otnot": 922,
998
+ "footnot": 923,
999
+ "footnote": 924,
1000
+ "ominus": 925,
1001
+ "bigtriangleup": 926,
1002
+ "doteq": 927,
1003
+ "em": 928,
1004
+ "Ġ\\_": 929,
1005
+ "vphantom": 930,
1006
+ "jlim": 931,
1007
+ "ok": 932,
1008
+ "hook": 933,
1009
+ "hd": 934,
1010
+ "longmapsto": 935,
1011
+ "cro": 936,
1012
+ "omi": 937,
1013
+ "cron": 938,
1014
+ "omicron": 939,
1015
+ "bigsqcup": 940,
1016
+ "Ġ?": 941,
1017
+ "nup": 942,
1018
+ "tharpoonup": 943,
1019
+ "texttt": 944,
1020
+ "footnotesize": 945,
1021
+ "vdash": 946,
1022
+ "hookrightarrow": 947,
1023
+ "textsc": 948,
1024
+ "\\#": 949,
1025
+ "rightharpoonup": 950,
1026
+ "bigvee": 951,
1027
+ "large": 952,
1028
+ "tabu": 953,
1029
+ "tabular": 954,
1030
+ "lap": 955,
1031
+ "supseteq": 956,
1032
+ "emph": 957,
1033
+ "Ti": 958,
1034
+ "Tilde": 959,
1035
+ "Ġ\"": 960,
1036
+ "thing": 961,
1037
+ "varno": 962,
1038
+ "varnothing": 963,
1039
+ "sec": 964,
1040
+ "rtimes": 965,
1041
+ "Ġ`": 966,
1042
+ "ral": 967,
1043
+ "triangleright": 968,
1044
+ "\\;": 969,
1045
+ "min": 970,
1046
+ "dotsb": 971,
1047
+ "searrow": 972,
1048
+ "mkern": 973,
1049
+ "plus": 974,
1050
+ "med": 975,
1051
+ "boxplus": 976,
1052
+ "ãĢ": 977,
1053
+ "Lar": 978,
1054
+ "Large": 979,
1055
+ "tu": 980,
1056
+ "rai": 981,
1057
+ "natu": 982,
1058
+ "arcsin": 983,
1059
+ "natural": 984,
1060
+ "raise": 985,
1061
+ "nmid": 986,
1062
+ "rhd": 987,
1063
+ "\\:": 988,
1064
+ "triangleleft": 989,
1065
+ "medskip": 990,
1066
+ "csc": 991,
1067
+ "dle": 992,
1068
+ "middle": 993,
1069
+ "ãĢģ": 994,
1070
+ "hf": 995,
1071
+ "enskip": 996,
1072
+ "deg": 997,
1073
+ "cdotp": 998,
1074
+ "ts": 999,
1075
+ "gets": 1000,
1076
+ "boxed": 1001,
1077
+ "preceq": 1002,
1078
+ "asym": 1003,
1079
+ "asymp": 1004,
1080
+ "on": 1005,
1081
+ "arg": 1006,
1082
+ "AA": 1007,
1083
+ "ner": 1008,
1084
+ "Ġ\\&": 1009,
1085
+ "inter": 1010,
1086
+ "intercal": 1011,
1087
+ "cop": 1012,
1088
+ "projlim": 1013,
1089
+ "ou": 1014,
1090
+ "ĠãĢģ": 1015,
1091
+ "subsetneq": 1016,
1092
+ "rod": 1017,
1093
+ "coprod": 1018,
1094
+ "pou": 1019,
1095
+ "smi": 1020,
1096
+ "varprojlim": 1021,
1097
+ "nds": 1022,
1098
+ "pounds": 1023,
1099
+ "smile": 1024,
1100
+ "drel": 1025,
1101
+ "buil": 1026,
1102
+ "buildrel": 1027,
1103
+ "nearrow": 1028,
1104
+ "smallsetminus": 1029,
1105
+ "Ġ\\/": 1030,
1106
+ "injlim": 1031,
1107
+ "succeq": 1032,
1108
+ "low": 1033,
1109
+ "ill": 1034,
1110
+ "put": 1035,
1111
+ "otect": 1036,
1112
+ "protect": 1037,
1113
+ "varinjlim": 1038,
1114
+ "prescript": 1039,
1115
+ "rut": 1040,
1116
+ "strut": 1041,
1117
+ "lnot": 1042,
1118
+ "iiint": 1043,
1119
+ "nfrac": 1044,
1120
+ "rlap": 1045,
1121
+ "ver": 1046,
1122
+ "genfrac": 1047,
1123
+ "\\!": 1048,
1124
+ "intop": 1049,
1125
+ "suit": 1050,
1126
+ "gt": 1051,
1127
+ "sl": 1052,
1128
+ "hfill": 1053,
1129
+ "mathver": 1054,
1130
+ "sion": 1055,
1131
+ "dotsm": 1056,
1132
+ "mathversion": 1057,
1133
+ "ddagger": 1058,
1134
+ "gtr": 1059,
1135
+ "max": 1060,
1136
+ "ke": 1061,
1137
+ "make": 1062,
1138
+ "makebox": 1063,
1139
+ "pe": 1064,
1140
+ "use": 1065,
1141
+ "beca": 1066,
1142
+ "because": 1067,
1143
+ "Bar": 1068,
1144
+ "ak": 1069,
1145
+ "noalign": 1070,
1146
+ "break": 1071,
1147
+ "leng": 1072,
1148
+ "length": 1073,
1149
+ "tch": 1074,
1150
+ "ule": 1075,
1151
+ "bigtriangledown": 1076,
1152
+ "frow": 1077,
1153
+ "lhd": 1078,
1154
+ "frown": 1079,
1155
+ "ipsi": 1080,
1156
+ "vspace": 1081,
1157
+ "mathell": 1082,
1158
+ "rcor": 1083,
1159
+ "real": 1084,
1160
+ "ipsis": 1085,
1161
+ "mathellipsis": 1086,
1162
+ "rcorner": 1087,
1163
+ "mo": 1088,
1164
+ "Ġâ": 1089,
1165
+ "null": 1090,
1166
+ "mode": 1091,
1167
+ "ĠâĢ": 1092,
1168
+ "ls": 1093,
1169
+ "lrcorner": 1094,
1170
+ "mspace": 1095,
1171
+ "gtrsim": 1096,
1172
+ "models": 1097,
1173
+ "smallskip": 1098,
1174
+ "vline": 1099,
1175
+ "Ġ\\*": 1100,
1176
+ "col": 1101,
1177
+ "Bbb": 1102,
1178
+ "arpoon": 1103,
1179
+ "Bbbk": 1104,
1180
+ "allow": 1105,
1181
+ "rule": 1106,
1182
+ "uph": 1107,
1183
+ "upharpoon": 1108,
1184
+ "hslash": 1109,
1185
+ "llap": 1110,
1186
+ "ĠâĢIJ": 1111,
1187
+ "upharpoonright": 1112,
1188
+ "deset": 1113,
1189
+ "sideset": 1114,
1190
+ "allowbreak": 1115,
1191
+ "ltimes": 1116,
1192
+ "labe": 1117,
1193
+ "bigm": 1118,
1194
+ "raisebox": 1119,
1195
+ "label": 1120,
1196
+ "arrayst": 1121,
1197
+ "retch": 1122,
1198
+ "arraystretch": 1123,
1199
+ "bigstar": 1124,
1200
+ "sep": 1125,
1201
+ "smash": 1126,
1202
+ "diamondsuit": 1127,
1203
+ "sha": 1128,
1204
+ "shape": 1129,
1205
+ "colsep": 1130,
1206
+ "ns": 1131,
1207
+ "tharpoons": 1132,
1208
+ "Ġ\\-": 1133,
1209
+ "new": 1134,
1210
+ "ce": 1135,
1211
+ "normalsize": 1136,
1212
+ "arraycolsep": 1137,
1213
+ "thin": 1138,
1214
+ "rightlef": 1139,
1215
+ "rightleftharpoons": 1140,
1216
+ "SS": 1141,
1217
+ "he": 1142,
1218
+ "dast": 1143,
1219
+ "setlength": 1144,
1220
+ "circledast": 1145,
1221
+ "cent": 1146,
1222
+ "ure": 1147,
1223
+ "pict": 1148,
1224
+ "picture": 1149,
1225
+ "thinspace": 1150,
1226
+ "ddag": 1151,
1227
+ "hphantom": 1152,
1228
+ "mma": 1153,
1229
+ "rd": 1154,
1230
+ "surd": 1155,
1231
+ "comma": 1156,
1232
+ "command": 1157,
1233
+ "uplus": 1158,
1234
+ "newcommand": 1159,
1235
+ "renewcommand": 1160,
1236
+ "Vec": 1161,
1237
+ "limit": 1162,
1238
+ "Ġ---": 1163,
1239
+ "Vdash": 1164,
1240
+ "eth": 1165,
1241
+ "longleftarrow": 1166,
1242
+ "cu": 1167,
1243
+ "leqq": 1168,
1244
+ "cur": 1169,
1245
+ "oslash": 1170,
1246
+ "sure": 1171,
1247
+ "mathc": 1172,
1248
+ "bigcirc": 1173,
1249
+ "ensure": 1174,
1250
+ "ensuremath": 1175,
1251
+ "xleftarrow": 1176,
1252
+ "varlim": 1177,
1253
+ "inte": 1178,
1254
+ "triangledown": 1179,
1255
+ "rner": 1180,
1256
+ "sho": 1181,
1257
+ "mathor": 1182,
1258
+ "corner": 1183,
1259
+ "short": 1184,
1260
+ "mathord": 1185,
1261
+ "Ġcm": 1186,
1262
+ "rtex": 1187,
1263
+ "intertex": 1188,
1264
+ "intertext": 1189,
1265
+ "hfil": 1190,
1266
+ "an": 1191,
1267
+ "bigskip": 1192,
1268
+ "wr": 1193,
1269
+ "vartriangle": 1194,
1270
+ "warrow": 1195,
1271
+ "updownarrow": 1196,
1272
+ "ae": 1197,
1273
+ "mathrel": 1198,
1274
+ "endaligned": 1199
1275
+ },
1276
+ "merges": [
1277
+ "Ġ }",
1278
+ "Ġ {",
1279
+ "Ġ \\",
1280
+ "Ġ _",
1281
+ "Ġ ^",
1282
+ "Ġ )",
1283
+ "Ġ (",
1284
+ "Ġ 1",
1285
+ "Ġ 2",
1286
+ "Ġ =",
1287
+ "Ġ ,",
1288
+ "Ġ -",
1289
+ "r a",
1290
+ "Ġ +",
1291
+ "m a",
1292
+ "ra c",
1293
+ "f rac",
1294
+ "t a",
1295
+ "Ġ 0",
1296
+ "i g",
1297
+ "Ġ i",
1298
+ "Ġ x",
1299
+ "a l",
1300
+ "Ġ n",
1301
+ "l e",
1302
+ "a r",
1303
+ "m e",
1304
+ "t h",
1305
+ "i n",
1306
+ "Ġ .",
1307
+ "f t",
1308
+ "p h",
1309
+ "Ġ t",
1310
+ "Ġ k",
1311
+ "ma th",
1312
+ "Ġ d",
1313
+ "ig h",
1314
+ "r igh",
1315
+ "a m",
1316
+ "Ġ a",
1317
+ "t i",
1318
+ "e ta",
1319
+ "le ft",
1320
+ "righ t",
1321
+ "Ġ m",
1322
+ "Ġ j",
1323
+ "Ġ\\ ,",
1324
+ "Ġ p",
1325
+ "d e",
1326
+ "Ġ s",
1327
+ "Ġ r",
1328
+ "Ġ 3",
1329
+ "Ġ e",
1330
+ "s i",
1331
+ "Ġ |",
1332
+ "r i",
1333
+ "m u",
1334
+ "Ġ c",
1335
+ "Ġ &",
1336
+ "ph a",
1337
+ "al pha",
1338
+ "d a",
1339
+ "Ġ l",
1340
+ "o t",
1341
+ "d ot",
1342
+ "b da",
1343
+ "am bda",
1344
+ "Ġ u",
1345
+ "Ġ f",
1346
+ "Ġ g",
1347
+ "Ġ A",
1348
+ "Ġ z",
1349
+ "Ġ 4",
1350
+ "p si",
1351
+ "p ri",
1352
+ "pri me",
1353
+ "Ġ\\ \\",
1354
+ "Ġ y",
1355
+ "p ar",
1356
+ "ti al",
1357
+ "par tial",
1358
+ "t o",
1359
+ "ph i",
1360
+ "r m",
1361
+ "Ġ b",
1362
+ "math rm",
1363
+ "n a",
1364
+ "l ta",
1365
+ "Ġ q",
1366
+ "s u",
1367
+ "am ma",
1368
+ "l ambda",
1369
+ "c al",
1370
+ "Ġ N",
1371
+ "Ġ ~",
1372
+ "p i",
1373
+ "su m",
1374
+ "o p",
1375
+ "Ġ T",
1376
+ "n u",
1377
+ "Ġ ]",
1378
+ "l o",
1379
+ "v ar",
1380
+ "op e",
1381
+ "to r",
1382
+ "ra tor",
1383
+ "ope rator",
1384
+ "na me",
1385
+ "operator name",
1386
+ "in t",
1387
+ "ig ma",
1388
+ "Ġ v",
1389
+ "g a",
1390
+ "me ga",
1391
+ "Ġ B",
1392
+ "Ġ R",
1393
+ "b eta",
1394
+ "r t",
1395
+ "Ġ h",
1396
+ "Ġ L",
1397
+ "v e",
1398
+ "Ġ [",
1399
+ "Ġ /",
1400
+ "lo n",
1401
+ "b f",
1402
+ "Ġ M",
1403
+ "th eta",
1404
+ "de lta",
1405
+ "Ġ D",
1406
+ "Ġ X",
1407
+ "n g",
1408
+ "Ġ\\ ;",
1409
+ "psi lon",
1410
+ "b ar",
1411
+ "s igma",
1412
+ "g amma",
1413
+ "ng le",
1414
+ "Ġ C",
1415
+ "Ġ F",
1416
+ "Ġ S",
1417
+ "l de",
1418
+ "ti lde",
1419
+ "c dot",
1420
+ "e psilon",
1421
+ "Ġ H",
1422
+ "s q",
1423
+ "Ġ P",
1424
+ "h i",
1425
+ "h o",
1426
+ "r ho",
1427
+ "ra y",
1428
+ "ar ray",
1429
+ "l a",
1430
+ "h a",
1431
+ "ha t",
1432
+ "b e",
1433
+ "n d",
1434
+ "sq rt",
1435
+ "Ġ o",
1436
+ "e nd",
1437
+ "g in",
1438
+ "be gin",
1439
+ "q u",
1440
+ "ta u",
1441
+ "in ft",
1442
+ "inft y",
1443
+ "Ġ E",
1444
+ "x i",
1445
+ "Ġ V",
1446
+ "Ġ I",
1447
+ "Ġ 5",
1448
+ "o mega",
1449
+ "Ġ G",
1450
+ "Ġ *",
1451
+ "Ġ w",
1452
+ "math cal",
1453
+ "b ig",
1454
+ "r o",
1455
+ "g e",
1456
+ "Ġ :",
1457
+ "me s",
1458
+ "ti mes",
1459
+ "le q",
1460
+ "a d",
1461
+ "Ġ Q",
1462
+ "qu ad",
1463
+ "Ġ K",
1464
+ "math bf",
1465
+ "dot s",
1466
+ "Ġ 6",
1467
+ "D e",
1468
+ "De lta",
1469
+ "a p",
1470
+ "var phi",
1471
+ "Ġ\\ }",
1472
+ "l l",
1473
+ "Ġ J",
1474
+ "Ġ U",
1475
+ "G amma",
1476
+ "ra ngle",
1477
+ "P hi",
1478
+ "l in",
1479
+ "lin e",
1480
+ "t ri",
1481
+ "r line",
1482
+ "Ġ\\ {",
1483
+ "Ġ ;",
1484
+ "Ġ W",
1485
+ "O mega",
1486
+ "Ġ Z",
1487
+ "ma tri",
1488
+ "matri x",
1489
+ "o ve",
1490
+ "Ġ 8",
1491
+ "Ġ\\ |",
1492
+ "ap p",
1493
+ "\\ {",
1494
+ "s e",
1495
+ "e ll",
1496
+ "ve c",
1497
+ "B ig",
1498
+ "L ambda",
1499
+ "p m",
1500
+ "b b",
1501
+ "math bb",
1502
+ "c hi",
1503
+ "Ġ <",
1504
+ "Ġ Y",
1505
+ "b la",
1506
+ "na bla",
1507
+ "la ngle",
1508
+ "ta r",
1509
+ "ove rline",
1510
+ "var epsilon",
1511
+ "Ġ O",
1512
+ "ro w",
1513
+ "Ġ >",
1514
+ "w i",
1515
+ "wi de",
1516
+ "e d",
1517
+ "app a",
1518
+ "k appa",
1519
+ "cdot s",
1520
+ "tar row",
1521
+ "l dots",
1522
+ "P si",
1523
+ "z eta",
1524
+ "Ġ\\ !",
1525
+ "ge q",
1526
+ "Ġ 7",
1527
+ "e x",
1528
+ "righ tarrow",
1529
+ "i v",
1530
+ "o times",
1531
+ "b o",
1532
+ "Ġ 9",
1533
+ "bo l",
1534
+ "s t",
1535
+ "q quad",
1536
+ "p ro",
1537
+ "r c",
1538
+ "pro d",
1539
+ "e qu",
1540
+ "equ iv",
1541
+ "se t",
1542
+ "c i",
1543
+ "ci rc",
1544
+ "ve rt",
1545
+ "wide tilde",
1546
+ "c a",
1547
+ "se s",
1548
+ "ca ses",
1549
+ "S igma",
1550
+ "Ġ !",
1551
+ "da g",
1552
+ "c o",
1553
+ "t ex",
1554
+ "w ed",
1555
+ "wed ge",
1556
+ "\\ }",
1557
+ "big g",
1558
+ "tex t",
1559
+ "u n",
1560
+ "s in",
1561
+ "un de",
1562
+ "si m",
1563
+ "ge r",
1564
+ "dag ger",
1565
+ "e q",
1566
+ "ig n",
1567
+ "al ign",
1568
+ "align ed",
1569
+ "u s",
1570
+ "P i",
1571
+ "co s",
1572
+ "u p",
1573
+ "wide hat",
1574
+ "c ap",
1575
+ "m i",
1576
+ "a st",
1577
+ "T h",
1578
+ "Th eta",
1579
+ "Ġ\\ :",
1580
+ "bol d",
1581
+ "c k",
1582
+ "n eq",
1583
+ "mi d",
1584
+ "su b",
1585
+ "c up",
1586
+ "o m",
1587
+ "b rac",
1588
+ "b in",
1589
+ "bin om",
1590
+ "f ra",
1591
+ "math fra",
1592
+ "mathfra k",
1593
+ "l n",
1594
+ "l us",
1595
+ "s y",
1596
+ "sy m",
1597
+ "h bar",
1598
+ "sym bol",
1599
+ "bold symbol",
1600
+ "op lus",
1601
+ "s p",
1602
+ "a ngle",
1603
+ "unde rline",
1604
+ "ex p",
1605
+ "f o",
1606
+ "ro x",
1607
+ "app rox",
1608
+ "r e",
1609
+ "ra ll",
1610
+ "fo rall",
1611
+ "sub set",
1612
+ "\\ |",
1613
+ "e r",
1614
+ "y le",
1615
+ "st yle",
1616
+ "brac e",
1617
+ "V e",
1618
+ "s ta",
1619
+ "sta ck",
1620
+ "Ve rt",
1621
+ "s tar",
1622
+ "p er",
1623
+ "per p",
1624
+ "i o",
1625
+ "io ta",
1626
+ "unde r",
1627
+ "ove r",
1628
+ "var theta",
1629
+ "o r",
1630
+ "big l",
1631
+ "big r",
1632
+ "text rm",
1633
+ "d i",
1634
+ "n ot",
1635
+ "s f",
1636
+ "m p",
1637
+ "a c",
1638
+ "X i",
1639
+ "lo g",
1640
+ "Big r",
1641
+ "re l",
1642
+ "me q",
1643
+ "si meq",
1644
+ "Big l",
1645
+ "stack rel",
1646
+ "f lo",
1647
+ "flo or",
1648
+ "la y",
1649
+ "sp lay",
1650
+ "di splay",
1651
+ "display style",
1652
+ "sp ac",
1653
+ "spac e",
1654
+ "Big g",
1655
+ "in us",
1656
+ "var rho",
1657
+ "v dots",
1658
+ "lon g",
1659
+ "Ġ '",
1660
+ "m inus",
1661
+ "text bf",
1662
+ "set minus",
1663
+ "bigg r",
1664
+ "bigg l",
1665
+ "d iv",
1666
+ "math sf",
1667
+ "U psilon",
1668
+ "s c",
1669
+ "s la",
1670
+ "h space",
1671
+ "l le",
1672
+ "s h",
1673
+ "ta n",
1674
+ "long rightarrow",
1675
+ "tri angle",
1676
+ "p t",
1677
+ "ri pt",
1678
+ "sc ript",
1679
+ "d dot",
1680
+ "under brace",
1681
+ "sin h",
1682
+ "brac k",
1683
+ "s to",
1684
+ "p sto",
1685
+ "ma psto",
1686
+ "c h",
1687
+ "to p",
1688
+ "sub stack",
1689
+ "left rightarrow",
1690
+ "e ck",
1691
+ "ch eck",
1692
+ "cos h",
1693
+ "big cup",
1694
+ "lo r",
1695
+ "su p",
1696
+ "sup set",
1697
+ "o int",
1698
+ "b u",
1699
+ "var pi",
1700
+ "r floor",
1701
+ "l i",
1702
+ "not in",
1703
+ "u psilon",
1704
+ "l floor",
1705
+ "sq cup",
1706
+ "Ġ\\ #",
1707
+ "ve e",
1708
+ "subset eq",
1709
+ "bold math",
1710
+ "o d",
1711
+ "par row",
1712
+ "text style",
1713
+ "p r",
1714
+ "ar row",
1715
+ "d dots",
1716
+ "co lon",
1717
+ "u parrow",
1718
+ "big oplus",
1719
+ "lle t",
1720
+ "bu llet",
1721
+ "li m",
1722
+ "n t",
1723
+ "op to",
1724
+ "pr opto",
1725
+ "over rightarrow",
1726
+ "sla nt",
1727
+ "over brace",
1728
+ "i t",
1729
+ "R igh",
1730
+ "Righ tarrow",
1731
+ "o w",
1732
+ "ow n",
1733
+ "i l",
1734
+ "b ot",
1735
+ "sla sh",
1736
+ "under set",
1737
+ "d own",
1738
+ "i math",
1739
+ "sq cap",
1740
+ "R e",
1741
+ "var sigma",
1742
+ "ar p",
1743
+ "co ng",
1744
+ "e il",
1745
+ "down arrow",
1746
+ "le f",
1747
+ "over set",
1748
+ "circ le",
1749
+ "de t",
1750
+ "lef tarrow",
1751
+ "s s",
1752
+ "e mp",
1753
+ "t y",
1754
+ "emp ty",
1755
+ "empty set",
1756
+ "n o",
1757
+ "script style",
1758
+ "circle d",
1759
+ "a ra",
1760
+ "p ara",
1761
+ "lle l",
1762
+ "para llel",
1763
+ "text circled",
1764
+ "b re",
1765
+ "Ġ- -",
1766
+ "I m",
1767
+ "leq slant",
1768
+ "bre ve",
1769
+ "a ck",
1770
+ "b ack",
1771
+ "var Phi",
1772
+ "back slash",
1773
+ "g g",
1774
+ "text up",
1775
+ "le ss",
1776
+ "bo x",
1777
+ "n e",
1778
+ "var Delta",
1779
+ "sh arp",
1780
+ "var Gamma",
1781
+ "z e",
1782
+ "less sim",
1783
+ "si ze",
1784
+ "ma l",
1785
+ "n to",
1786
+ "pha nto",
1787
+ "phanto m",
1788
+ "var Theta",
1789
+ "u t",
1790
+ "var Psi",
1791
+ "n or",
1792
+ "nor mal",
1793
+ "var Upsilon",
1794
+ "var Omega",
1795
+ "m be",
1796
+ "nu mbe",
1797
+ "numbe r",
1798
+ "no number",
1799
+ "a top",
1800
+ "big cap",
1801
+ "m od",
1802
+ "w p",
1803
+ "t t",
1804
+ "var Lambda",
1805
+ "xi st",
1806
+ "xist s",
1807
+ "c t",
1808
+ "e xists",
1809
+ "var Xi",
1810
+ "L o",
1811
+ "var Sigma",
1812
+ "text normal",
1813
+ "pm od",
1814
+ "Lo ng",
1815
+ "var Pi",
1816
+ "e ct",
1817
+ "j math",
1818
+ "r vert",
1819
+ "l vert",
1820
+ "in g",
1821
+ "r brace",
1822
+ "Bigg r",
1823
+ "s ma",
1824
+ "Bigg l",
1825
+ "j ect",
1826
+ "b ject",
1827
+ "ac ut",
1828
+ "acut e",
1829
+ "script size",
1830
+ "rc eil",
1831
+ "sma ll",
1832
+ "l brack",
1833
+ "c eil",
1834
+ "l ceil",
1835
+ "t in",
1836
+ "tin y",
1837
+ "script scriptstyle",
1838
+ "o dot",
1839
+ "i int",
1840
+ "L e",
1841
+ "c ot",
1842
+ "Le ft",
1843
+ "Left rightarrow",
1844
+ "ar c",
1845
+ "math r",
1846
+ "c frac",
1847
+ "re f",
1848
+ "mathr ing",
1849
+ "k er",
1850
+ "p re",
1851
+ "e n",
1852
+ "math it",
1853
+ "b mod",
1854
+ "o nd",
1855
+ "over leftarrow",
1856
+ "h line",
1857
+ "Long rightarrow",
1858
+ "pm b",
1859
+ "am ond",
1860
+ "ker n",
1861
+ "s k",
1862
+ "geq slant",
1863
+ "di amond",
1864
+ "i p",
1865
+ "sk ip",
1866
+ "arc tan",
1867
+ "x rightarrow",
1868
+ "math tt",
1869
+ "l brace",
1870
+ "eq q",
1871
+ "l ar",
1872
+ "pre c",
1873
+ "colon eqq",
1874
+ "l Vert",
1875
+ "f la",
1876
+ "fla t",
1877
+ "r Vert",
1878
+ "arp o",
1879
+ "big wedge",
1880
+ "dots c",
1881
+ "g ra",
1882
+ "gra ve",
1883
+ "o bject",
1884
+ "la nd",
1885
+ "ĠO bject",
1886
+ "e ph",
1887
+ "al eph",
1888
+ "under rightarrow",
1889
+ "over leftrightarrow",
1890
+ "Long leftrightarrow",
1891
+ "r brack",
1892
+ "arpo o",
1893
+ "tan h",
1894
+ "under leftarrow",
1895
+ "ti t",
1896
+ "tex tit",
1897
+ "en space",
1898
+ "B o",
1899
+ "under leftrightarrow",
1900
+ "Bo x",
1901
+ "n i",
1902
+ "sq u",
1903
+ "triangle q",
1904
+ "math op",
1905
+ "f f",
1906
+ "i ff",
1907
+ "big triangle",
1908
+ "mi t",
1909
+ "big otimes",
1910
+ "ne g",
1911
+ "eq ref",
1912
+ "ar e",
1913
+ "squ are",
1914
+ "co th",
1915
+ "\\ ,",
1916
+ "da sh",
1917
+ "box times",
1918
+ "h dots",
1919
+ "th arpoo",
1920
+ "under bar",
1921
+ "di m",
1922
+ "text sf",
1923
+ "c c",
1924
+ "var kappa",
1925
+ "e re",
1926
+ "th ere",
1927
+ "fo re",
1928
+ "there fore",
1929
+ "i mp",
1930
+ "imp li",
1931
+ "e s",
1932
+ "impli es",
1933
+ "Ġ\\ %",
1934
+ "s b",
1935
+ "long leftrightarrow",
1936
+ "su cc",
1937
+ "f box",
1938
+ "ot not",
1939
+ "fo otnot",
1940
+ "footnot e",
1941
+ "om inus",
1942
+ "bigtriangle up",
1943
+ "dot eq",
1944
+ "e m",
1945
+ "Ġ\\ _",
1946
+ "v phantom",
1947
+ "j lim",
1948
+ "o k",
1949
+ "ho ok",
1950
+ "h d",
1951
+ "long mapsto",
1952
+ "c ro",
1953
+ "o mi",
1954
+ "cro n",
1955
+ "omi cron",
1956
+ "big sqcup",
1957
+ "Ġ ?",
1958
+ "nu p",
1959
+ "tharpoo nup",
1960
+ "text tt",
1961
+ "footnote size",
1962
+ "v dash",
1963
+ "hook rightarrow",
1964
+ "text sc",
1965
+ "\\ #",
1966
+ "righ tharpoonup",
1967
+ "big vee",
1968
+ "lar ge",
1969
+ "ta bu",
1970
+ "tabu lar",
1971
+ "la p",
1972
+ "supset eq",
1973
+ "em ph",
1974
+ "T i",
1975
+ "Ti lde",
1976
+ "Ġ \"",
1977
+ "th ing",
1978
+ "var no",
1979
+ "varno thing",
1980
+ "se c",
1981
+ "r times",
1982
+ "Ġ `",
1983
+ "ra l",
1984
+ "triangle right",
1985
+ "\\ ;",
1986
+ "m in",
1987
+ "dots b",
1988
+ "se arrow",
1989
+ "m kern",
1990
+ "p lus",
1991
+ "me d",
1992
+ "box plus",
1993
+ "ã Ģ",
1994
+ "L ar",
1995
+ "Lar ge",
1996
+ "t u",
1997
+ "ra i",
1998
+ "na tu",
1999
+ "arc sin",
2000
+ "natu ral",
2001
+ "rai se",
2002
+ "n mid",
2003
+ "r hd",
2004
+ "\\ :",
2005
+ "triangle left",
2006
+ "med skip",
2007
+ "c sc",
2008
+ "d le",
2009
+ "mid dle",
2010
+ "ãĢ ģ",
2011
+ "h f",
2012
+ "en skip",
2013
+ "de g",
2014
+ "cdot p",
2015
+ "t s",
2016
+ "ge ts",
2017
+ "box ed",
2018
+ "prec eq",
2019
+ "a sym",
2020
+ "asym p",
2021
+ "o n",
2022
+ "ar g",
2023
+ "A A",
2024
+ "n er",
2025
+ "Ġ\\ &",
2026
+ "int er",
2027
+ "inter cal",
2028
+ "c op",
2029
+ "pro jlim",
2030
+ "o u",
2031
+ "Ġ ãĢģ",
2032
+ "subset neq",
2033
+ "ro d",
2034
+ "cop rod",
2035
+ "p ou",
2036
+ "s mi",
2037
+ "var projlim",
2038
+ "nd s",
2039
+ "pou nds",
2040
+ "smi le",
2041
+ "d rel",
2042
+ "bu il",
2043
+ "buil drel",
2044
+ "ne arrow",
2045
+ "small setminus",
2046
+ "Ġ\\ /",
2047
+ "in jlim",
2048
+ "succ eq",
2049
+ "lo w",
2050
+ "i ll",
2051
+ "p ut",
2052
+ "ot ect",
2053
+ "pr otect",
2054
+ "var injlim",
2055
+ "pre script",
2056
+ "r ut",
2057
+ "st rut",
2058
+ "ln ot",
2059
+ "i iint",
2060
+ "n frac",
2061
+ "r lap",
2062
+ "ve r",
2063
+ "ge nfrac",
2064
+ "\\ !",
2065
+ "in top",
2066
+ "su it",
2067
+ "g t",
2068
+ "s l",
2069
+ "hf ill",
2070
+ "math ver",
2071
+ "si on",
2072
+ "dots m",
2073
+ "mathver sion",
2074
+ "d dagger",
2075
+ "gt r",
2076
+ "ma x",
2077
+ "k e",
2078
+ "ma ke",
2079
+ "make box",
2080
+ "p e",
2081
+ "u se",
2082
+ "be ca",
2083
+ "beca use",
2084
+ "B ar",
2085
+ "a k",
2086
+ "no align",
2087
+ "bre ak",
2088
+ "le ng",
2089
+ "leng th",
2090
+ "t ch",
2091
+ "u le",
2092
+ "bigtriangle down",
2093
+ "f row",
2094
+ "l hd",
2095
+ "frow n",
2096
+ "i psi",
2097
+ "v space",
2098
+ "math ell",
2099
+ "rc or",
2100
+ "re al",
2101
+ "ipsi s",
2102
+ "mathell ipsis",
2103
+ "rcor ner",
2104
+ "m o",
2105
+ "Ġ â",
2106
+ "nu ll",
2107
+ "mo de",
2108
+ "Ġâ Ģ",
2109
+ "l s",
2110
+ "l rcorner",
2111
+ "m space",
2112
+ "gtr sim",
2113
+ "mode ls",
2114
+ "small skip",
2115
+ "v line",
2116
+ "Ġ\\ *",
2117
+ "co l",
2118
+ "B bb",
2119
+ "arpoo n",
2120
+ "Bbb k",
2121
+ "al low",
2122
+ "r ule",
2123
+ "u ph",
2124
+ "uph arpoon",
2125
+ "h slash",
2126
+ "l lap",
2127
+ "ĠâĢ IJ",
2128
+ "upharpoon right",
2129
+ "de set",
2130
+ "si deset",
2131
+ "allow break",
2132
+ "l times",
2133
+ "la be",
2134
+ "big m",
2135
+ "raise box",
2136
+ "labe l",
2137
+ "array st",
2138
+ "re tch",
2139
+ "arrayst retch",
2140
+ "big star",
2141
+ "se p",
2142
+ "sma sh",
2143
+ "diamond suit",
2144
+ "s ha",
2145
+ "sha pe",
2146
+ "col sep",
2147
+ "n s",
2148
+ "tharpoo ns",
2149
+ "Ġ\\ -",
2150
+ "ne w",
2151
+ "c e",
2152
+ "normal size",
2153
+ "array colsep",
2154
+ "th in",
2155
+ "right lef",
2156
+ "rightlef tharpoons",
2157
+ "S S",
2158
+ "h e",
2159
+ "da st",
2160
+ "set length",
2161
+ "circle dast",
2162
+ "ce nt",
2163
+ "u re",
2164
+ "pi ct",
2165
+ "pict ure",
2166
+ "thin space",
2167
+ "d dag",
2168
+ "h phantom",
2169
+ "m ma",
2170
+ "r d",
2171
+ "su rd",
2172
+ "co mma",
2173
+ "comma nd",
2174
+ "up lus",
2175
+ "new command",
2176
+ "re newcommand",
2177
+ "Ve c",
2178
+ "li mit",
2179
+ "Ġ-- -",
2180
+ "V dash",
2181
+ "e th",
2182
+ "long leftarrow",
2183
+ "c u",
2184
+ "leq q",
2185
+ "cu r",
2186
+ "o slash",
2187
+ "su re",
2188
+ "math c",
2189
+ "big circ",
2190
+ "en sure",
2191
+ "ensure math",
2192
+ "x leftarrow",
2193
+ "var lim",
2194
+ "int e",
2195
+ "triangle down",
2196
+ "r ner",
2197
+ "s ho",
2198
+ "math or",
2199
+ "co rner",
2200
+ "sho rt",
2201
+ "mathor d",
2202
+ "Ġc m",
2203
+ "rt ex",
2204
+ "inte rtex",
2205
+ "intertex t",
2206
+ "hf il",
2207
+ "a n",
2208
+ "big skip",
2209
+ "w r",
2210
+ "var triangle",
2211
+ "w arrow",
2212
+ "up downarrow",
2213
+ "a e",
2214
+ "math rel",
2215
+ "end aligned"
2216
+ ]
2217
+ }
2218
+ }
tokenizer_config.json ADDED
@@ -0,0 +1,55 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "<pad>",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "1": {
12
+ "content": "<s>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "2": {
20
+ "content": "</s>",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "3": {
28
+ "content": "<unk>",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "4": {
36
+ "content": "<mask>",
37
+ "lstrip": true,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "bos_token": "<s>",
45
+ "clean_up_tokenization_spaces": true,
46
+ "cls_token": "<s>",
47
+ "eos_token": "</s>",
48
+ "mask_token": "<mask>",
49
+ "model_max_length": 512,
50
+ "pad_token": "<pad>",
51
+ "processor_class": "TrOCRProcessor",
52
+ "sep_token": "</s>",
53
+ "tokenizer_class": "XLMRobertaTokenizer",
54
+ "unk_token": "<unk>"
55
+ }