TensorFlow-based implementation of "Pyramid Scene Parsing Network".

Last update: Dec 20, 2022

Overview

PSPNet_tensorflow

Important

Code is fine for inference. However, the training code is just for reference and might be only used for fine-tuning. If you want to train from scratch, you need to implement the Synchronize BN layer first to do large batch-size training (as described in the paper). It seems that this repo has reproduced it, you can take a look on it.

Introduction

This is an implementation of PSPNet in TensorFlow for semantic segmentation on the cityscapes dataset. We first convert weight from Original Code by using caffe-tensorflow framework.

Update:

News (2018.11.08 updated):

Now you can try PSPNet on your own image online using ModelDepot live demo!

2018/01/24:

Support evaluation code for ade20k dataset

2018/01/19:

Support inference phase for ade20k dataset using model of pspnet50 (convert weights from original author)
Using tf.matmul to decode label, so as to improve the speed of inference.

2017/11/06:

Support different input size by padding input image to (720, 720) if original size is smaller than it, and get result by cropping image in the end.

2017/10/27:

Change bn layer from tf.nn.batch_normalization into tf.layers.batch_normalization in order to support training phase. Also update initial model in Google Drive.

Install

Get restore checkpoint from Google Drive and put into model directory. Note: Select the checkpoint corresponding to the dataset.

Inference

To get result on your own images, use the following command:

python inference.py --img-path=./input/test.png --dataset cityscapes

Inference time: ~0.6s

Options:

--dataset cityscapes or ade20k
--flipped-eval 
--checkpoints /PATH/TO/CHECKPOINT_DIR

Evaluation

Cityscapes

Perform in single-scaled model on the cityscapes validation datase.

Method	Accuracy
Without flip	76.99%
Flip	77.23%

ade20k

Method	Accuracy
Without flip	40.00%
Flip	40.67%

To re-produce evluation results, do following steps:

Download Cityscape dataset or ADE20k dataset first.
change data_dir to your dataset path in evaluate.py:

'data_dir': ' = /Path/to/dataset'

Run the following command:

python evaluate.py --dataset cityscapes

List of Args:

--dataset - ade20k or cityscapes
--flipped-eval  - Using flipped evaluation method
--measure-time  - Calculate inference time

Image Result

cityscapes

Input image	Output image

ade20k

Input image	Output image

real world

Input image	Output image

Citation

@article{zhao2017pspnet,
  author = {Hengshuang Zhao and
            Jianping Shi and
            Xiaojuan Qi and
            Xiaogang Wang and
            Jiaya Jia},
  title = {Pyramid Scene Parsing Network},
  booktitle = {Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2017}
}

Scene Parsing through ADE20K Dataset. B. Zhou, H. Zhao, X. Puig, S. Fidler, A. Barriuso and A. Torralba. Computer Vision and Pattern Recognition (CVPR), 2017. (http://people.csail.mit.edu/bzhou/publication/scene-parse-camera-ready.pdf)

@inproceedings{zhou2017scene,
    title={Scene Parsing through ADE20K Dataset},
    author={Zhou, Bolei and Zhao, Hang and Puig, Xavier and Fidler, Sanja and Barriuso, Adela and Torralba, Antonio},
    booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
    year={2017}
}

Semantic Understanding of Scenes through ADE20K Dataset. B. Zhou, H. Zhao, X. Puig, S. Fidler, A. Barriuso and A. Torralba. arXiv:1608.05442. (https://arxiv.org/pdf/1608.05442.pdf)

@article{zhou2016semantic,
  title={Semantic understanding of scenes through the ade20k dataset},
  author={Zhou, Bolei and Zhao, Hang and Puig, Xavier and Fidler, Sanja and Barriuso, Adela and Torralba, Antonio},
  journal={arXiv preprint arXiv:1608.05442},
  year={2016}
}

TensorFlow-based implementation of "Pyramid Scene Parsing Network".

Related tags

Overview

PSPNet_tensorflow

Important

Introduction

Update:

News (2018.11.08 updated):

2018/01/24:

2018/01/19:

2017/11/06:

2017/10/27:

Install

Inference

Evaluation

Cityscapes

ade20k

Image Result

cityscapes

ade20k

real world

Citation

Owner

HsuanKung Yang

RoFormer_pytorch

Inferring Lexicographically-Ordered Rewards from Preferences

The official implementation of "Rethink Dilated Convolution for Real-time Semantic Segmentation"

Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [2021]

A package for "Procedural Content Generation via Reinforcement Learning" OpenAI Gym interface.

[NeurIPS'21 Spotlight] PyTorch code for our paper "Aligned Structured Sparsity Learning for Efficient Image Super-Resolution"

This toolkit provides codes to download and pre-process the SLUE datasets, train the baseline models, and evaluate SLUE tasks.

Official repository for GCR rerank, a GCN-based reranking method for both image and video re-ID

An LSTM based GAN for Human motion synthesis

QHack—the quantum machine learning hackathon

Offline Reinforcement Learning with Implicit Q-Learning

AAAI 2022: Stationary diffusion state neural estimation

Pytorch implementation of Learning Rate Dropout.

Progressive Image Deraining Networks: A Better and Simpler Baseline

Pytorch implementation of the paper "Enhancing Content Preservation in Text Style Transfer Using Reverse Attention and Conditional Layer Normalization"

A simple AI that will give you si ple task and this is made with python

Real-Time SLAM for Monocular, Stereo and RGB-D Cameras, with Loop Detection and Relocalization Capabilities

Some experiments with tennis player aging curves using Hilbert space GPs in PyMC. Only experimental for now.

InsightFace: 2D and 3D Face Analysis Project on MXNet and PyTorch

This is a five-step framework for the development of intrusion detection systems (IDS) using machine learning (ML) considering model realization, and performance evaluation.