Deep Watershed Transform for Instance Segmentation

Last update: Nov 20, 2022

Related tags

Overview

Deep Watershed Transform

Performs instance level segmentation detailed in the following paper:

Min Bai and Raquel Urtasun, Deep Watershed Transformation for Instance Segmentation, in CVPR 2017. Accessible at https://arxiv.org/abs/1611.08303.

This page is still under construction.

Dependencies

Developed and tested on Ubuntu 14.04 and 16.04.

TensorFlow www.tensorflow.org
Numpy, Scipy, and Skimage (sudo apt-get install python-numpy python-scipy python-skimage)

Inputs

Cityscapes images (www.cityscapes-dataset.com).
Semantic Segmentation for input images. In our case, we used the output from PSPNet (by H. Zhao et al. https://github.com/hszhao/PSPNet). These are uint8 images with pixel-wise semantic labels encoded with 'trainIDs' defined by Cityscapes. For more information, visit https://github.com/mcordts/cityscapesScripts/blob/master/cityscapesscripts/helpers/labels.py

Outputs

The model produces pixel-wise instance labels as a uint16 image with the same formatting as the Cityscapes instance segmentation challenge ground truth. In particular, each pixel is labeled as 'id' * 1000 + instance_id, where 'id' is as defined by Cityscapes (for more information, consult labels.py in the above link), and instance_id is an integer indexing the object instance.

Testing the Model

Clone repository into dwt/.
Download the model from www.cs.toronto.edu/~mbai/dwt_cityscapes_pspnet.mat and place into the "dwt/model" directory.
run "cd E2E"
run "python main.py"
The results will be available in "dwt/example/output".

Training the Model

Will be available soon.

Deep Watershed Transform for Instance Segmentation

Related tags

Overview

Deep Watershed Transform

Dependencies

Inputs

Outputs

Testing the Model

Training the Model

Owner

Official Chainer implementation of GP-GAN: Towards Realistic High-Resolution Image Blending (ACMMM 2019, oral)

COPA-SSE contains crowdsourced explanations for the Balanced COPA dataset

A collection of easy-to-use, ready-to-use, interesting deep neural network models

Implementation of Hire-MLP: Vision MLP via Hierarchical Rearrangement and An Image Patch is a Wave: Phase-Aware Vision MLP.

交互式标注软件，暂定名 iann

C3DPO - Canonical 3D Pose Networks for Non-rigid Structure From Motion.

Global Rhythm Style Transfer Without Text Transcriptions

Change Detection in SAR Images Based on Multiscale Capsule Network

Offical implementation of Shunted Self-Attention via Multi-Scale Token Aggregation

an Evolutionary Algorithm assisted GAN

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

PlenOctrees: NeRF-SH Training & Conversion

YOLOv4-v3 Training Automation API for Linux

A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.

HW3 ― GAN, ACGAN and UDA

Official repository for HOTR: End-to-End Human-Object Interaction Detection with Transformers (CVPR'21, Oral Presentation)

Learning-Augmented Dynamic Power Management

A method that utilized Generative Adversarial Network (GAN) to interpret the black-box deep image classifier models by PyTorch.

From Fidelity to Perceptual Quality: A Semi-Supervised Approach for Low-Light Image Enhancement (CVPR'2020)