Document Classification with LayoutLM

This repository contains code for a document classification project using the LayoutLM model. The goal of this project is to accurately classify various types of documents, such as birth certificates, driving licenses, social security numbers, and tax documents, using layout-aware deep learning techniques.

Introduction
Features
Getting Started
- Prerequisites
- Installation
Usage
Data Preprocessing
Training
Evaluation
Model Inference
Contributing
License

Introduction

Document classification is a crucial task in various domains, including legal, finance, and healthcare. This project leverages the LayoutLM model, which is designed to understand the content and structure of documents by considering both text and bounding box information. With this model, we achieved an impressive accuracy of 89% on our test dataset.

Features

Document classification using LayoutLM.
Data preprocessing scripts for handling text and bounding box information.
Training pipeline for fine-tuning the LayoutLM model.
Evaluation scripts to measure model performance.
Model inference code for classifying new documents.

Getting Started

Prerequisites

Before running the code, make sure you have the following prerequisites installed:

Python 3.x
PyTorch
Transformers library by Hugging Face
Datasets library by Hugging Face

Installation

Clone this repository to your local machine:

git clone https://github.com/atulpokharel-gp/Document-Classification-using-LayoutLM
cd Document-Classification-using-LayoutLM