RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection

Last update: Jun 29, 2022

Overview

RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection

For more details, please refer to our paper.

Citing

Please cite the related works in your publications if it helps your research:

@inproceedings{liao2018rotation,
  title={Rotation-Sensitive Regression for Oriented Scene Text Detection},
  author={Liao, Minghui and Zhu, Zhen and Shi, Baoguang and Xia, Gui-song and Bai, Xiang},
  booktitle={Proc. CVPR},
  pages={5909--5918},
  year={2018}
}

Models

model trained on ICDAR 2015 Incidental Text
BaiduYun
Google Drive

Training of other models are in progress.

Demo

Download the ICDAR 2015 model and place it in "./models/ic15/"

python examples/text/demo.py

The detection results and recognition results are in "./visu_demo/"

Training

Coming soon

Owner

Minghui Liao

Minghui Liao, a Ph.D. student of Huazhong University of Science and Technology.

GitHub Repository https://github.com/MhLiao/RRD

Code for CVPR 2022 paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory"

Bailando Code for CVPR 2022 (oral) paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory" [Paper] | [Project Page] | [Vi

237 Dec 29, 2022

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Introduction English | 简体中文 MMOCR is an open-source toolbox based on PyTorch and mmdetection for text detection, text recognition, and the correspondi

3k Jan 07, 2023

An official PyTorch implementation of the paper "Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences", ICCV 2021.

PyTorch implementation of Learning by Aligning (ICCV 2021) This is an official PyTorch implementation of the paper "Learning by Aligning: Visible-Infr

30 Nov 05, 2022

A pkg stiching around view images(4-6cameras) to generate bird's eye view.

AVP-BEV-OPEN Please check our new work AVP_SLAM_SIM A pkg stiching around view images(4-6cameras) to generate bird's eye view! View Demo · Report Bug

37 Dec 01, 2022

PianoVisuals - Create background videos synced with piano music using opencv

Steps Record piano video Use Neural Network to do body segmentation (video matti

4 Jan 24, 2022

make a better chinese character recognition OCR than tesseract

deep ocr See README_en.md for English installation documentation. 只在ubuntu下面测试通过，需要virtualenv安装，安装路径可自行调整： git clone https://github.com/JinpengLI/deep

1.5k Dec 28, 2022

Implementation of our paper 'PixelLink: Detecting Scene Text via Instance Segmentation' in AAAI2018

Code for the AAAI18 paper PixelLink: Detecting Scene Text via Instance Segmentation, by Dan Deng, Haifeng Liu, Xuelong Li, and Deng Cai. Contributions

758 Dec 22, 2022

This is the open source implementation of the ICLR2022 paper "StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis"

StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image

840 Dec 26, 2022

OCR, Object Detection, Number Plate, Real Time

README.md PrePareded anaconda env requirements.txt clova AI → deep text recognition → trained weights (ex, .pth) wpod-net weights (ex, .h5 , .json) ht

7 Dec 06, 2022

🔎 Like Chardet. 🚀 Package for encoding & language detection. Charset detection.

Charset Detection, for Everyone 👋 The Real First Universal Charset Detector A library that helps you read text from an unknown charset encoding. Moti

332 Dec 31, 2022

This repository provides train＆test code, dataset, det.&rec. annotation, evaluation script, annotation tool, and ranking.

SCUT-CTW1500 Datasets We have updated annotations for both train and test set. Train: 1000 images [images][annos] Additional point annotation for each

600 Dec 18, 2022

Extract tables from scanned image PDFs using Optical Character Recognition.

ocr-table This project aims to extract tables from scanned image PDFs using Optical Character Recognition. Install Requirements Tesseract OCR sudo apt

209 Dec 06, 2022

Generating .npy dataset and labels out of given image, containing numbers from 0 to 9, using opencv

basic-dataset-generator-from-image-of-numbers generating .npy dataset and labels out of given image, containing numbers from 0 to 9, using opencv inpu

1 Jan 01, 2022

Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.

Convolutional Recurrent Neural Network This software implements the Convolutional Recurrent Neural Network (CRNN), a combination of CNN, RNN and CTC l

2k Dec 31, 2022

OCR engine for all the languages

Description kraken is a turn-key OCR system optimized for historical and non-Latin script material. kraken's main features are: Fully trainable layout

431 Jan 04, 2023

Smart computer vision application

Smart-computer-vision-application Backend : opencv and python Library required:

2 Jan 31, 2022

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

hocr-tools About About the code Installation System-wide with pip System-wide from source virtualenv Available Programs hocr-check -- check the hOCR f

285 Dec 08, 2022

RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection

Related tags

Overview

RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection

Citing

Models

Demo

Training

Owner

Minghui Liao

Code for CVPR 2022 paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory"

OpenMMLab Text Detection, Recognition and Understanding Toolbox

An official PyTorch implementation of the paper "Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences", ICCV 2021.

A pkg stiching around view images(4-6cameras) to generate bird's eye view.

PianoVisuals - Create background videos synced with piano music using opencv

make a better chinese character recognition OCR than tesseract

Implementation of our paper 'PixelLink: Detecting Scene Text via Instance Segmentation' in AAAI2018

This is the open source implementation of the ICLR2022 paper "StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis"

OCR, Object Detection, Number Plate, Real Time

🔎 Like Chardet. 🚀 Package for encoding & language detection. Charset detection.

This repository provides train＆test code, dataset, det.&rec. annotation, evaluation script, annotation tool, and ranking.

Extract tables from scanned image PDFs using Optical Character Recognition.

Generating .npy dataset and labels out of given image, containing numbers from 0 to 9, using opencv

Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.

OCR engine for all the languages

Smart computer vision application

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

A curated list of promising OCR resources

A Vietnamese personal card OCR website built with Django.

Deep Learning Chinese Word Segment