Form extraction
The process of data extraction from paper forms differs a bit from simple digitizing process. Let´s have a look at the procedure itself:
- Preparation of forms for scanning, sorting of documents according to their type, removing of staples, cleaning etc.
- Scanning on high quality scanner (accurate data extraction needs a high quality input image in order to prevent software from making reading mistakes)
- Determining data extraction areas (structured X unstructured forms)
- Involvement of OCR/ICR/BCR/OMR technologies for optical recognition of machine written or hand written text, reading of barcodes and check marks
- Automatic validation of recognized data – extracted data is compared with general or user made dictionaries
- SQL database level of validation – ie. marks duplicate documents
- In case of mistake presence, manual validation comes on the scene, where the corectness of extracted data is being compared with database records and original paper documents
We extract data form all common types of paper forms. For more information please do not hesitate to contact us.

