Detect handwritten words in a text-line (classic image processing method).

Last update: Jan 03, 2023

Overview

Word segmentation

Implementation of scale space technique for word segmentation as proposed by R. Manmatha and N. Srimal. Even though the paper is from 1999, the method still achieves good results, is fast, and is easy to implement. The algorithm takes an image of a line as input and outputs the segmented words.

Run demo

Go to the src/ directory and run the script python main.py. The images from the data/ directory (taken from IAM dataset) are segmented into words and the results are saved to the out/ directory.

Documentation

An anisotropic filter kernel is applied to the input image to create blobs corresponding to words. After thresholding the blob-image, connected components are extracted which correspond to words.

Parameters

Most of the parameters of the function wordSegmentation deal with the shape of the filter kernel:

img: grayscale uint8 image of the text-line to be segmented.
kernelSize: size of filter kernel, must be an odd integer.
sigma: standard deviation of Gaussian function used for filter kernel.
theta: approximated width/height ratio of words, filter function is distorted by this factor.
minArea: ignore word candidates smaller than specified area.

The function prepareImg can be used to convert the input image to grayscale and to resize it to a fixed height:

img: input image.
height: image will be resized to fit specified height.

Algorithm

The illustration below shows how the algorithm works:

top left: input image.
top right: filter kernel is applied.
bottom left: blob image after thresholding.
bottom right: bounding boxes around words in original image.

Results

This algorithm gives good results on datasets with large inter-word-distances and small intra-word-distances like IAM. However, for historical datasets like Bentham or Ratsprotokolle results are not very good and more complex approaches should be used instead (e.g., a neural network based approach as implemented in the WordDetectorNN repository).

Detect handwritten words in a text-line (classic image processing method).

Related tags

Overview

Word segmentation

Run demo

Documentation

Parameters

Algorithm

Results

Owner

Harald Scheidl

A simple Digits Recogniser made in Python

Captcha Recognition

The code for “Oriented RepPoints for Aerail Object Detection”

This project modify tensorflow object detection api code to predict oriented bounding boxes. It can be used for scene text detection.

The project is an official implementation of our paper "3D Human Pose Estimation with Spatial and Temporal Transformers".

Bu uygulamada Python ve Opencv kullanarak bilgisayar kamerasından yüz tespiti yapıyoruz.

1st place solution for SIIM-FISABIO-RSNA COVID-19 Detection Challenge

Detecting Text in Natural Image with Connectionist Text Proposal Network (ECCV'16)

A webcam-based 3x3x3 rubik's cube solver written in Python 3 and OpenCV.

Rest API Written In Python To Classify NSFW Images.

The code of "Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes"

A tool for extracting text from scanned documents (via OCR), with user-defined post-processing.

Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.

End-to-end pipeline for real-time scene text detection and recognition.

A Python script to capture images from multiple webcams at once and save them into your local machine

Introduction to Augmented Reality (AR) with Python 3 and OpenCV 4.2.

Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model"

A small C++ implementation of LSTM networks, focused on OCR.

[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别

Motion detector, Full body detection, Upper body detection, Cat face detection, Smile detection, Face detection (haar cascade), Silverware detection, Face detection (lbp), and Sending email notifications