TextBoxes++-TensorFlow

TextBoxes++ re-implementation using tensorflow. This project is greatly inspired by slim project And many functions are modified based on SSD-tensorflow project

Author: Zhisheng Zou [email protected]

pretrained model

Google drive

environment

python2.7/python3.5

tensorflow-gpu 1.8.0

at least one gpu

how to use

Getting the xml file like this example xml and put the image together because we need the format like this standard xml
1. picture format: *.png or *.PNG
Getting the xml and flags ensure the XML file is under the same directory as the corresponding image.execute the code: convert_xml_format.py
1. python tools/convert_xml_format.py -i in_dir -s split_flag -l save_logs -o output_dir
2. in_dir means the absolute directory which contains the pic and xml
3. split_flag means whether or not to split the datasets
4. save_logs means whether to save train_xml.txt
5. output_dir means where to save xmls
Getting the tfrecords
1. python gene_tfrecords.py --xml_img_txt_path=./logs/train_xml.txt --output_dir=tfrecords
2. xml_img_txt_path like this train xml
3. output_dir means where to save tfrecords
Training
1. python train.py --train_dir =some_path --dataset_dir=some_path --checkpoint_path=some_path
2. train_dir store the checkpoints when training
3. dataset_dir store the tfrecords for training
4. checkpoint_path store the model which needs to be fine tuned
Testing
1. python test.py -m /home/model.ckpt-858 -o test
2. -m which means the model
3. -o which means output_result_dir
4. -i which means the test img dir
5. -c which means use which device to run the test
6. -n which means the nms threshold
7. -s which means the score threshold

Note:

when you are training the model, you can run the eval_result.py to eval your model and save the result

Textboxes_plusplus implementation with Tensorflow (python)

Related tags

Overview

TextBoxes++-TensorFlow

pretrained model

environment

how to use

Note:

Owner

Crop regions in napari manually

Usando o Amazon Textract como OCR para Extração de Dados no DynamoDB

Motion detector, Full body detection, Upper body detection, Cat face detection, Smile detection, Face detection (haar cascade), Silverware detection, Face detection (lbp), and Sending email notifications

PianoVisuals - Create background videos synced with piano music using opencv

aardio的opencv库

OpenCVを用いたカメラキャリブレーションのサンプルです。2021/06/21時点でPython実装のある3種類(通常カメラ向け、魚眼レンズ向け(fisheyeモジュール)、全方位カメラ向け(omnidirモジュール))について用意しています。

Automatically remove the mosaics in images and videos, or add mosaics to them.

GDB python tool to pretty print and debug c++ xtensor containers

Brief idea about our project is mentioned in project presentation file.

DouZero is a reinforcement learning framework for DouDizhu - 斗地主AI

Tensorflow-based CNN+LSTM trained with CTC-loss for OCR

YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）

keras复现场景文本检测网络CPTN: 《Detecting Text in Natural Image with Connectionist Text Proposal Network》；欢迎试用，关注，并反馈问题...

Create single line SVG illustrations from your pictures

Introduction to Augmented Reality (AR) with Python 3 and OpenCV 4.2.

CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)

A selectional auto-encoder approach for document image binarization

Face Detection with DLIB

ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data