Optical Character Recognition made seamless & accessible to anyone, powered by PyTorch

Task: recognition

Example usage:

>>> from doctr.io import DocumentFile
>>> from doctr.models import ocr_predictor, from_hub

>>> img = DocumentFile.from_images(['<image_path>'])
>>> # Load your model from the hub
>>> model = from_hub('mindee/my-model')

>>> # Pass it to the predictor
>>> # If your model is a recognition model:
>>> predictor = ocr_predictor(det_arch='db_resnet50',
>>>                           reco_arch=model,
>>>                           pretrained=True)

>>> # Get your predictions
>>> res = predictor(img)

Training configuration and logs: https://wandb.ai/xbankov/text-recognition

Run Configuration

{ "hf_dataset_name": "fimu-docproc-research/born_digital", "name": "master_20221106-223158", "epochs": 50, "lr": 0.001, "weight_decay": 0, "batch_size": 512, "input_size": 32, "sched": "cosine", "sample": null, "workers": 16, "wb": true, "push_to_hub": "fimu-docproc-research/master_0.0.1", "test_only": false, "arch": "master" }

Downloads last month
7
Inference API
Unable to determine this model’s pipeline type. Check the docs .