Pytorch implementation of PSEnet with Pyramid Attention Network as feature extractor

Last update: Oct 10, 2022

Overview

Scene Text-Spotting based on PSEnet+CRNN

Pytorch implementation of an end to end Text-Spotter with a PSEnet text detector and CRNN text recognizer. We plan to grow this repository into an open research platform for multi-lingual text detection and recognition from natural scene images, targeted towards low-resource languages.

Requirements

Python 3.6.5
Pytorch 1.2
pyclipper
Polygon 3.0.8
OpenCV 3.4.1

Demo

Download the trained CRNN and PSEnet models from the links provided below.
Copy paths of the models and paste them in params.py
run end-end.py

python end-end.py --img [path to image] --e2e_config_name [end to end config name]

Pre-trained Models

Both PSEnet and CRNN pre-trained models can be found here: gdrive

the PSEnet model is a multi-lingual text detector, trained on MLT 2019. Works quite well!
the CRNN recognizes Hindi, Bangla, Malayalam, Kanada, Tamil, Telugu, Odia, Sanskrit, Marathi!

Download the models in models/ directory and modify params.py if required.

Training instructions

To train your own detection model refer to this file.
To train your own recognition model refer to this file.

Samples

Contributors

Azhar Shaikh, PES University LinkedIn
Nishant Sinha, OffNote Labs

Work done as part of Internship with OffNote Labs.

References

If this repository helps you, please star it. Thank you!

Pytorch implementation of PSEnet with Pyramid Attention Network as feature extractor

Related tags

Overview

Scene Text-Spotting based on PSEnet+CRNN

Requirements

Demo

Pre-trained Models

Training instructions

Samples

Contributors

References

Owner

azhar shaikh

Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.

GDB python tool to pretty print and debug c++ xtensor containers

Introduction to Augmented Reality (AR) with Python 3 and OpenCV 4.2.

POT : Python Optimal Transport

DouZero is a reinforcement learning framework for DouDizhu - 斗地主AI

Image processing using OpenCv

An interactive interface for using OpenCV's GrabCut algorithm for image segmentation.

document image degradation

nofacedb/faceprocessor is a face recognition engine for NoFaceDB program complex.

Motion Detection Squid Game with OpenCV Python

This is a project to detect gestures to zoom in or out, using the real-time distance between the index finger and the thumb. It's based on OpenCV and Mediapipe.

Official PyTorch implementation for "Mixed supervision for surface-defect detection: from weakly to fully supervised learning"

Fatigue Driving Detection Based on Dlib

Pytorch implementation of PSEnet with Pyramid Attention Network as feature extractor

Optical character recognition for Japanese text, with the main focus being Japanese manga

A python scripts that uses 3 different feature extraction methods such as SIFT, SURF and ORB to find a book in a video clip and project trailer of a movie based on that book, on to it.

Image processing is one of the most common term in computer vision

OCR, Scene-Text-Understanding, Text Recognition

OpenGait is a flexible and extensible gait recognition project

Corner-based Region Proposal Network