File size: 4,414 Bytes
a11fed1
 
 
 
 
 
 
186f27c
5199d25
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
a11fed1
186f27c
a11fed1
 
 
 
 
 
 
 
 
 
 
 
 
 
7815858
a11fed1
 
506ea9c
 
ec90221
 
 
a11fed1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
186f27c
a11fed1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
186f27c
a11fed1
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
---
tags:
- web form detection
- form detection
- ui form detection
- form detect
- ui form detect
- web form ui detection
- Web form recognition
- Form field detection
- Form element detection
- Form field extraction
- Form structure analysis
- User interface form detection
- Web form parser
- Data extraction from forms
- Form input identification
- Form layout analysis
- Form boundary detection
- UI component recognition
- Form field segmentation
- Form element extraction
- Form data extraction
- Form field recognition
- Form text extraction
- Form structure detection
- Form field localization
- Web form layout analysis
model-index:
- name: foduucom/web-form-ui-field-detection
  results:
  - task:
      type: object-detection
    metrics:
    - type: precision
      value: 0.52196
      name: [email protected](box)
language:
- en
metrics:
- accuracy
pipeline_tag: object-detection
---
<div align="center">
  <img width="640" alt="foduucom/web-form-ui-field-detection" src="https://huggingface.co/foduucom/web-form-ui-field-detection/resolve/main/web-form-ui-field-detection%20banner.jpeg">
</div>


# Model architecture
<div align="center">
  <img width="640" alt="foduucom/web-form-ui-field-detection" src="https://huggingface.co/foduucom/web-form-ui-field-detection/resolve/main/ui%20architecture.jpeg">
</div>


# Model Overview
The web-form-Detect model is a yolov8 object detection model trained to detect and locate ui form fields in images. It is built upon the ultralytics library and fine-tuned using a dataset of annotated ui form images.

## Intended Use
The model is intended to be used for detecting details like Name,number,email,password,button,redio bullet and so on fields in images. It can be incorporated into applications that require automated detection ui form fields from images.

## Performance
The model has been evaluated on a held-out test dataset and achieved the following performance metrics:

Average Precision (AP): 0.51
Precision: 0.80
Recall: 0.70
F1 Score: 0.71
Please note that the actual performance may vary based on the input data distribution and quality.


## How to Get Started with the Model
To get started with the YOLOv8s object Detection model use for web ui detection, follow these steps:


```bash
pip install ultralyticsplus==0.0.28 ultralytics==8.0.43
```

- Load model and perform prediction:

```python

from ultralyticsplus import YOLO, render_result

# load model
model = YOLO('foduucom/web-form-ui-field-detection')

# set model parameters
model.overrides['conf'] = 0.25  # NMS confidence threshold
model.overrides['iou'] = 0.45  # NMS IoU threshold
model.overrides['agnostic_nms'] = False  # NMS class-agnostic
model.overrides['max_det'] = 1000  # maximum number of detections per image

# set image
image = '/path/to/your/document/images'

# perform inference
results = model.predict(image)

# observe results
print(results[0].boxes)
render = render_result(model=model, image=image, result=results[0])
render.show()
```

## Training Data
The model was trained on a diverse dataset containing images of web ui form data from different sources, resolutions, and lighting conditions. The dataset was annotated with bounding box coordinates to indicate the location of the ui form fields within the image.

Total Number of Images: 600
Annotation Format: Bounding box coordinates (xmin, ymin, xmax, ymax)

## Fine-tuning Process
 - Pretrained Model: TheError: Errors in your YAML metadata model was initialized with a pretrained object detection backbone (e.g. YOLO).
 - Loss Function: Mean Average Precision (mAP) loss was used for optimization during training.
 - Optimizer: Adam optimizer with a learning rate of 1e-4.
 - Batch Size:-1
 - Training Time: 1 hours on a single NVIDIA GeForce RTX 3090 GPU.
## Model Limitations
The model's performance is subject to variations in image quality, lighting conditions, and image resolutions.
The model may struggle with detecting web ui form in cases of extreme occlusion.
The model may not generalize well to non-standard ui form formats or variations.

#### Software

The model was trained and fine-tuned using a Jupyter Notebook environment.

## Model Card Contact

For inquiries and contributions, please contact us at [email protected].

```bibtex
@ModelCard{
  author    = {Nehul Agrawal and
               Rahul parihar},
  title     = {YOLOv8s web-form ui fields detection},
  year      = {2023}
}
```

---