Pixel-wise segmentation on VOC2012 dataset using pytorch.

Last update: Dec 30, 2022

Overview

PiWiSe

Pixel-wise segmentation on the VOC2012 dataset using pytorch.

For a more complete implementation of segmentation networks checkout semseg.

Note:

FCN differs from original implementation see this issue
SegNet does not match original paper performance see here
PSPNet misses "atrous convolution" (conv layers of ResNet101 should be amended to preserve image size)

Keeping this in mind feel free to PR. Thank you!

Setup

See dataset examples here.

Download

Download image archive and extract and do:

mkdir data
mv VOCdevkit/VOC2012/JPEGImages data/images
mv VOCdevkit/VOC2012/SegmentationClass data/classes
rm -rf VOCdevkit

Install

We recommend using pyenv:

pyenv virtualenv 3.6.0 piwise
pyenv activate piwise

then install requirements with pip install -r requirements.txt.

Usage

For latest documentation use:

python main.py --help

Supported model parameters are fcn8, fcn16, fcn32, unet, segnet1, segnet2, pspnet.

Training

If you want to have visualization open an extra tab with:

python -m visdom.server -port 5000

Train the SegNet model 30 epochs with cuda support, visualization and checkpoints every 100 steps:

python main.py --cuda --model segnet2 train --datadir data \
    --num-epochs 30 --num-workers 4 --batch-size 4 \
    --steps-plot 50 --steps-save 100

Evaluation

Then we want to do semantic segmentation on foo.jpg:

python main.py --model segnet2 --state segnet2-30-0 eval foo.jpg foo.png

The segmented class image can now be found at foo.png.

Results

These are some results based on segnet after 40 epoches. Set

loss_weights[0] = 1 / 1

to deal gracefully with the unbalanced problem.

Input	Output	Ground Truth

Pixel-wise segmentation on VOC2012 dataset using pytorch.

Related tags

Overview

PiWiSe

Setup

Download

Install

Usage

Training

Evaluation

Results

Owner

Bodo Kaiser

torchlm is aims to build a high level pipeline for face landmarks detection, it supports training, evaluating, exporting, inference(Python/C++) and 100+ data augmentations

The first machine learning framework that encourages learning ML concepts instead of memorizing class functions.

Implementation for Shape from Polarization for Complex Scenes in the Wild

GeneDisco is a benchmark suite for evaluating active learning algorithms for experimental design in drug discovery.

An unofficial implementation of "Unpaired Image Super-Resolution using Pseudo-Supervision." CVPR2020

PyTorch implementation of DUL (Data Uncertainty Learning in Face Recognition, CVPR2020)

Lua-parser-lark - An out-of-box Lua parser written in Lark

Pytorch implementation of PTNet for high-resolution and longitudinal infant MRI synthesis

SpeechBrain is an open-source and all-in-one speech toolkit based on PyTorch.

PyTorch implementation of Trust Region Policy Optimization

pytorch implementation for PointNet

Experiments for Fake News explainability project

In this tutorial, you will perform inference across 10 well-known pre-trained object detectors and fine-tune on a custom dataset. Design and train your own object detector.

Streamlit component for TensorBoard, TensorFlow's visualization toolkit

Experiments for distributed optimization algorithms

This is the code for the paper "Motion-Focused Contrastive Learning of Video Representations" (ICCV'21).

Generalized Jensen-Shannon Divergence Loss for Learning with Noisy Labels

CS5242_2021 - Neural Networks and Deep Learning, NUS CS5242, 2021

Official PyTorch Implementation of paper "NeLF: Neural Light-transport Field for Single Portrait View Synthesis and Relighting", EGSR 2021.

This tutorial aims to learn the basics of deep learning by hands, and master the basics through combination of lectures and exercises