A higher performance pytorch implementation of DeepLab V3 Plus(DeepLab v3+)

Last update: Nov 22, 2022

Related tags

Overview

A Higher Performance Pytorch Implementation of DeepLab V3 Plus

Introduction

This repo is an (re-)implementation of Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation in PyTorch for semantic image segmentation on the PASCAL VOC dataset. And this repo has a higher mIoU of 79.19% than the result of paper which is 78.85%.

Requirements

Python(3.6) and Pytorch(0.4.1) is necessary before running the scripts. To install the required python packages(expect PyTorch), run

pip install -r requirements.txt

Datasets

To train and validate the network, this repo use the augmented PASCAL VOC 2012 dataset which contains 10582 images for training and 1449 images for validation. To use the dataset, you can download the PASCAL VOC training/validation data (2GB tar file) here and download the SegmentationClassAug from dropbox or Baidu Netdisk

Training

Before training, you should clone this repo:

git clone git@github.com:hualin95/Deeplab-v3plus.git

You can begin training by running the train.py.

#training
cd Deeplab-v3plus-master/tools/   
python train.py

You are expected to achieve PA:94.77%, MPA:88.48%, MIoU:79.19%, FWIoU:90.53% on the validation.

#Monitoring
tensorboard --logdir=runs/ --port=80

Performance

VOC2012: after 30k iterations with a batch size of 16.

Backbone	train OS	eval OS	MS	mIoU paper	mIoU repo
Resnet101	16	16	No	78.85%	79.19%

TODO

Resnet as Network Backbone
Implement depthwise separable convolutions
Multi-GPU support
Model pretrained on MS-COCO
Xception as Network Backbone

A higher performance pytorch implementation of DeepLab V3 Plus(DeepLab v3+)

Related tags

Overview

A Higher Performance Pytorch Implementation of DeepLab V3 Plus

Introduction

Requirements

Datasets

Training

Performance

TODO

Owner

linhua

EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction

Code for "Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans" CVPR 2021 best paper candidate

Markov Attention Models

An Implementation of SiameseRPN with Feature Pyramid Networks

An implementation of the BADGE batch active learning algorithm.

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

Azion the best solution of Edge Computing in the world.

A Genetic Programming platform for Python with TensorFlow for wicked-fast CPU and GPU support.

This repository is an implementation of paper : Improving the Training of Graph Neural Networks with Consistency Regularization

[CVPR 21] Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2021.

G-NIA model from "Single Node Injection Attack against Graph Neural Networks" (CIKM 2021)

Like Dirt-Samples, but cleaned up

Visualizing lattice vibration information from phonon dispersion to atoms (For GPUMD)

The dataset of tweets pulling from Twitters with keyword: Hydroxychloroquine, location: US, Time: 2020

Open-source Monocular Python HawkEye for Tennis

This repository provides code for "On Interaction Between Augmentations and Corruptions in Natural Corruption Robustness".

RuDOLPH: One Hyper-Modal Transformer can be creative as DALL-E and smart as CLIP

Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition

It's final year project of Diploma Engineering. This project is based on Computer Vision.

Official Pytorch implementation of the paper "Action-Conditioned 3D Human Motion Synthesis with Transformer VAE", ICCV 2021