Machine Learning to Denoise Images for Better OCR Accuracy

This project is an adaptation of this tutorial and used only for learning purposes: https://www.pyimagesearch.com/2021/10/20/using-machine-learning-to-denoise-images-for-better-ocr-accuracy/#download-the-code

Setting Up the project 🚀

First and foremost clone the project with:

$ git clone https://github.com/AntonioBriPerez/Ocr-Denoiser

You don't need to extract the zip files in order to train the model.

Once you have cloned the repository you will need to extract the features from the noisy images. This script will extract 5 x 5 - 25-d feature vectors and the it will extract the target (or cleaned) pixel value from the correspondiente ground truth standard image. And then, this features will be saved in a csv file (~200MB). To extract this features you will have to execute:

$ python3 build_features.py

It will generate the following output:

Once you have done that we will have to load those features in a proper split to train our Random Forest Regressor. That code is implemented in the file train_denoiser.py. To train the model you will have to run the command:

$ python train_denoiser.py

And it will generate:

To check that the model performs good you can execute:

$ python3 denoise_document.py --testing denoising-dirty-documents/test

And some images will be written in disk so you can check the original image and the image obtained by the model we just have trained.

Any doubts or suggestions please open an issue.

Machine Leaning applied to denoise images to improve OCR Accuracy

Related tags

Overview

Machine Learning to Denoise Images for Better OCR Accuracy

Setting Up the project 🚀

Owner

Antonio Bri Pérez

When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework (CVPR 2021 oral)

Tool which allow you to detect and translate text.

(CVPR 2021) ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection

Augmenting Anchors by the Detector Itself

Face Anonymizer - FaceAnonApp v1.0

computer vision, image processing and machine learning on the web browser or node.

Fun program to overlay a mask to yourself using a webcam

Slice a single image into multiple pieces and create a dataset from them

Smart computer vision application

An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

Official code for :rocket: Unsupervised Change Detection of Extreme Events Using ML On-Board :rocket:

CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)

Regions sanitàries (RS), Sectors Sanitàris (SS) i Àrees Bàsiques de Salut (ABS) de Catalunya

chineseocr/table_line 表格线检测模型pytorch版

The virtual calculator will be above the live streaming from your camera

Code for CVPR 2022 paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory"

2 telegram-bots: for image recognition and for text generation

[ICCV, 2021] Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

✌️Using this you can control your PC/Laptop volume by Hand Gestures created with Python.