Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Last update: Dec 06, 2022

Related tags

Overview

This is the official implementation of "Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation".

For more details, please refer to our paper.

Citing the paper

Please cite the paper in your publications if it helps your research:

@inproceedings{lyu2018multi,
      title={Multi-oriented scene text detection via corner localization and region segmentation},
      author={Lyu, Pengyuan and Yao, Cong and Wu, Wenhao and Yan, Shuicheng and Bai, Xiang},
      booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
      pages={7553--7563},
      year={2018}
}

Requirements
Installation
Models
Test
Train
License

Requirements

NVIDIA GPU, Ubuntu 14.04, Python2.7, CUDA8/9
PyTorch 0.2.0_3

Installation

git clone https://github.com/lvpengyuan/corner.git
sh ./make.sh   or  cd rpsroi_pooling && python build.py

Models

Download the model and place it in weights/

Our trained model: Google Drive;

Test

You can test a model in a single scale:

python eval_all.py

or in multi-scale:

python eval_multiscale.py

Note that, you should modify the model path and the test dataset before testing.

Train

python train.py

To train a new model, you should modify the training settings before training.

License

This code is only for academic purpose.

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Related tags

Overview

Citing the paper

Contents

Requirements

Installation

Models

Test

Train

License

Owner

Pengyuan Lyu

Hand Detection and Finger Detection on Live Feed

Let's explore how we can extract text from forms

learn how to use Gesture Control to change the volume of a computer

Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector

Repository of conference publications and source code for first-/ second-authored papers published at NeurIPS, ICML, and ICLR.

This repository summarized computer vision theories.

OCR system for Arabic language that converts images of typed text to machine-encoded text.

Maze generator and solver with python

Image Smoothing and Blurring Using OpenCV

A Vietnamese personal card OCR website built with Django.

Implementation of EAST scene text detector in Keras

Detect handwritten words in a text-line (classic image processing method).

A curated list of resources dedicated to scene text localization and recognition

This is a passport scanning web service to help you scan, identify and validate your passport created with a simple and flexible design and ready to be integrated right into your system!

Code for the ACL2021 paper "Combining Static Word Embedding and Contextual Representations for Bilingual Lexicon Induction"

nofacedb/faceprocessor is a face recognition engine for NoFaceDB program complex.

Detect textlines in document images

TedEval: A Fair Evaluation Metric for Scene Text Detectors

Code for CVPR 2022 paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory"

Detect the mathematical formula from the given picture and the same formula is extracted and converted into the latex code