Semantic segmentation task for ADE20k & cityscapse dataset, based on several models.

Last update: Oct 13, 2022

Overview

semantic-segmentation-tensorflow

This is a Tensorflow implementation of semantic segmentation models on MIT ADE20K scene parsing dataset and Cityscapes dataset We re-produce the inference phase of several models, including PSPNet, FCN, and ICNet by transforming the released pre-trained weights into tensorflow format, and apply on handcraft models. Also, we refer to ENet from freg856 github. Still working on task integrated.

Models

PSPNet
FCN
ENet
ICNet

...to be continue

Install

Get corresponding transformed pre-trained weights, and put into model directory:

FCN	PSPNet	ICNet
Google drive	Google drive	Google drive

Inference

Run following command:

python inference.py --img-path /Path/To/Image --dataset Model_Type

Arg list

--model - choose from "icnet"/"pspnet"/"fcn"/"enet"

Import module in your code:

from model import FCN8s, PSPNet50, ICNet, ENet

model = PSPNet50() # or another model

model.read_input(img_path)  # read image data from path

sess = tf.Session(config=config)
init = tf.global_variables_initializer()
sess.run(init)

model.load(model_path, sess)  # load pretrained model
preds = model.forward(sess) # Get prediction

Results

ade20k

Input Image	PSPNet	FCN

cityscapes

Input Image	ICNet	ENet

Citation

@inproceedings{zhao2017pspnet,
  author = {Hengshuang Zhao and
            Jianping Shi and
            Xiaojuan Qi and
            Xiaogang Wang and
            Jiaya Jia},
  title = {Pyramid Scene Parsing Network},
  booktitle = {Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2017}
}

Scene Parsing through ADE20K Dataset. B. Zhou, H. Zhao, X. Puig, S. Fidler, A. Barriuso and A. Torralba. Computer Vision and Pattern Recognition (CVPR), 2017. (http://people.csail.mit.edu/bzhou/publication/scene-parse-camera-ready.pdf)

@inproceedings{zhou2017scene,
    title={Scene Parsing through ADE20K Dataset},
    author={Zhou, Bolei and Zhao, Hang and Puig, Xavier and Fidler, Sanja and Barriuso, Adela and Torralba, Antonio},
    booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
    year={2017}
}

Semantic Understanding of Scenes through ADE20K Dataset. B. Zhou, H. Zhao, X. Puig, S. Fidler, A. Barriuso and A. Torralba. arXiv:1608.05442. (https://arxiv.org/pdf/1608.05442.pdf)

@article{zhou2016semantic,
  title={Semantic understanding of scenes through the ade20k dataset},
  author={Zhou, Bolei and Zhao, Hang and Puig, Xavier and Fidler, Sanja and Barriuso, Adela and Torralba, Antonio},
  journal={arXiv preprint arXiv:1608.05442},
  year={2016}
}

Semantic segmentation task for ADE20k & cityscapse dataset, based on several models.

Related tags

Overview

semantic-segmentation-tensorflow

Models

...to be continue

Install

Inference

Arg list

Import module in your code:

Results

ade20k

cityscapes

Citation

Owner

HsuanKung Yang

Copy Paste positive polyp using poisson image blending for medical image segmentation

House-GAN++: Generative Adversarial Layout Refinement Network towards Intelligent Computational Agent for Professional Architects

Let's Git - Versionsverwaltung & Open Source Hausaufgabe

Planar Prior Assisted PatchMatch Multi-View Stereo

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.

You Only Sample (Almost) Once: Linear Cost Self-Attention Via Bernoulli Sampling

Phylogeny Partners

Graph Analysis From Scratch

TalkingHead-1KH is a talking-head dataset consisting of YouTube videos

BASH - Biomechanical Animated Skinned Human

EmoTag helps you train emotion detection model for Chinese audios

Official Implement of CVPR 2021 paper “Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting”

dataset for ECCV 2020 "Motion Capture from Internet Videos"

This repository is an unoffical PyTorch implementation of Medical segmentation in 3D and 2D.

KSAI Lite is a deep learning inference framework of kingsoft, based on tensorflow lite

Use stochastic processes to generate samples and use them to train a fully-connected neural network based on Keras

Learning to Reconstruct 3D Manhattan Wireframes from a Single Image

Simple Tensorflow implementation of "Adaptive Convolutions for Structure-Aware Style Transfer" (CVPR 2021)

A Simulated Optimal Intrusion Response Game