A three-stage detection and recognition pipeline of complex meters in wild

This is the first released system towards detection and recognition of complex meters in wild. The system can be divided into three moduels. Fisrtly, a yolo-based detector is applied to get pure meter region. Secondly, a spatial transformer module is eatablished to rectify the position of meter. Lastly, an end-to-end network is to read meter values, which is implemented by pointer/dail predcition and key number learning.

Visulization results

Left row is the original image, middle row is the process of meter rectification, right row is the result of meter value reading.

ToDo List

Installation

Requirements:

Python3 (Python3.7 is recommended)
PyTorch >= 1.0
torchvision from master
numpy
skimage
OpenCV==3.0.x
CUDA >= 9.0 (10.0 is recommended)

Models

Download Trained model

Please put distro_net.pt into meter_distro/weight.
put textgraph_vgg_450.pth into model/meter_data.

Demo

You can run a demo script for a single image inference by two steps.

python get_meter_area.py. and the detected meter will be stored in scene_image_data/deteced_meter

python predict.py to get distored meter and final result.

This is the first released system towards complex meters` detection and recognition, which is implemented by computer vision techniques.

Related tags

Overview

A three-stage detection and recognition pipeline of complex meters in wild

Visulization results

ToDo List

Installation

Requirements:

Models

Demo

Owner

Yan Shu

Code for NeurIPS 2021 paper: Invariant Causal Imitation Learning for Generalizable Policies

Preprocessed Datasets for our Multimodal NER paper

Local trajectory planner based on a multilayer graph framework for autonomous race vehicles.

Train Dense Passage Retriever (DPR) with a single GPU

A simple Python library for stochastic graphical ecological models

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Quick program made to generate alpha and delta tables for Hidden Markov Models

Predictive Maintenance LSTM

🔮 A refreshing functional take on deep learning, compatible with your favorite libraries

Incorporating Transformer and LSTM to Kalman Filter with EM algorithm

[LREC] MMChat: Multi-Modal Chat Dataset on Social Media

A dual benchmarking study of visual forgery and visual forensics techniques

VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets

A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains (IJCV submission)

Perform zero-order Hankel Transform for an 1D array (float or real valued).

Second-order Attention Network for Single Image Super-resolution (CVPR-2019)

RLDS stands for Reinforcement Learning Datasets

Implementation of Shape Generation and Completion Through Point-Voxel Diffusion

Official implementation of the NeurIPS'21 paper 'Conditional Generation Using Polynomial Expansions'.

Playing around with FastAPI and streamlit to create a YoloV5 object detector