bart-base-grammar-synthesis
This model is a fine-tuned version of facebook/bart-base on an expanded version of the JFLEG dataset.
You can find other grammar-synthesis models by searching for the grammar synthesis tag
Basic Usage Example
Installation
First, make sure you have the transformers
package installed. You can install it using pip:
pip install -U transformers
Usage
from transformers import pipeline
# Initialize the text-generation pipeline for text correction
corrector = pipeline("text2text-generation", "pszemraj/bart-base-grammar-synthesis")
# Example text to correct
raw_text = "The toweris 324 met (1,063 ft) tall, about height as .An 81-storey building, and biggest longest structure paris. Is square, measuring 125 metres (410 ft) on each side. During its constructiothe eiffel tower surpassed the washington monument to become the tallest man-made structure in the world, a title it held for 41 yearsuntilthe chryslerbuilding in new york city was finished in 1930. It was the first structure to goat a height of 300 metres. Due 2 the addition ofa brdcasting aerial at the t0pp of the twr in 1957, it now taller than chrysler building 5.2 metres (17 ft). Exxxcluding transmitters, eiffel tower is 2ndd tallest ree-standing structure in france after millau viaduct."
# Correct the text using the text-generation pipeline
corrected_text = corrector(raw_text)[0]["generated_text"]
# Print the corrected text
print(corrected_text)
This example demonstrates how to use the text-generation pipeline to correct the grammar in a given text. The corrector
pipeline is initialized with the "pszemraj/bart-base-grammar-synthesis" model, which is designed for grammar correction. The corrector
pipeline takes the raw text as input and returns the corrected text. Make sure to install the required dependencies and models before running the code.
Intended uses & limitations
- robust grammar correction
- the model has a license of
cc-by-nc-sa-4.0
as it uses the JFLEG dataset + augments it for training
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 0.0001
- train_batch_size: 8
- eval_batch_size: 8
- seed: 42
- distributed_type: multi-GPU
- gradient_accumulation_steps: 16
- total_train_batch_size: 128
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: cosine
- lr_scheduler_warmup_ratio: 0.02
- num_epochs: 3.0
- Downloads last month
- 58