Textboxes : Image Text Detection Model : python package (tensorflow)

Last update: Dec 15, 2022

Overview

shinTB

Abstract

A python package for use Textboxes : Image Text Detection Model

implemented by tensorflow, cv2

Textboxes Paper Review in Korean (My Blog) : shinjayne.github.io/textboxes

shintb : useable textboxes python package (Source codes are in here)

svt1 : Street view Text dataset. can use with shintb.svt_data_loader.SVTDataLoader when training Textboxes model

config.py : (NECESSARY) configuration of model building and training with shinTB

main.py : simple example useage of shinTB package

Dependancies

python Version: 3.5.3
numpy Version: 1.13.0
tensorflow Version: 1.2.1
cv2

How to use

Clone this repository to your local.
You will use shintb python package and config.py for building and training your own Textboxes model.
svt1 gives us training / test data.
Open new python file.
Import config.config and shintb.

from config import config
from shintb import graph_drawer, default_box_control, svt_data_loader, runner

Initialize GraphDrawer,DefaultBoxControl,SVTDataLoader instance.

graphdrawer = graph_drawer.GraphDrawer(config)

dataloader = svt_data_loader.SVTDataLoader('./svt1/train.xml', './svt1/test.xml')

dbcontrol = default_box_control.DefaultBoxControl(config, graphdrawer)

GraphDrawer instance contains a tensorflow graph of Textboxes.
DefaultboxControl instance contains methods and attributes which is related to default box.
SVTDataLoader instance loads data from svt1.
Initialize Runner instance.

runner = runner.Runner(config, graphdrawer, dataloader, dbcontrol)

Runner uses GraphDrawer,DefaultBoxControl,SVTDataLoader instance.
If you want to train your Textboxes model, use Runner.train(). Every 1000 step, shintb will save ckpt file in the directory you set in config.py.

runner.train()

If you want to validate/test your model, use Runner.test()

runner.test()

After training, if you want to detect texts from one image use Runner.image().

runner.image(<your_image_directory>)

Textboxes : Image Text Detection Model : python package (tensorflow)

Related tags

Overview

shinTB

Abstract

Dependancies

How to use

Owner

Jayne Shin (신재인)

QuanTaichi: A Compiler for Quantized Simulations (SIGGRAPH 2021)

Toolbox for OCR post-correction

A small C++ implementation of LSTM networks, focused on OCR.

Let's explore how we can extract text from forms

A pure pytorch implemented ocr project including text detection and recognition

Face Recognizer using Opencv Python

Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector

The open source extract transaction infomation by using OCR.

Fusion 360 Add-in that creates a pair of toothed curves that can be used to split a body and create two pieces that slide and lock together.

A facial recognition program that plays a alarm (mp3 file) when a person i seen in the room. A basic theif using Python and OpenCV

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Detect the mathematical formula from the given picture and the same formula is extracted and converted into the latex code

Image processing is one of the most common term in computer vision

Document blur detection based on Laplacian operator and text detection.

Detect and fix skew in images containing text

Distort a video using Seam Carving (video) and Vibrato effect (sound)

Deep Learning Chinese Word Segment

Comparison-of-OCR (KerasOCR, PyTesseract,EasyOCR)

Introduction to Augmented Reality (AR) with Python 3 and OpenCV 4.2.

This repo contains a script that allows us to find range of colors in images using openCV, and then convert them into geo vectors.