FCN-semantic-segmentation

Simple end-to-end semantic segmentation using fully convolutional networks [1]. Takes a pretrained 34-layer ResNet [2], removes the fully connected layers, and adds transposed convolution layers with skip connections from lower layers. Initialises upsampling convolutions with bilinear interpolation filters and zeros the final (classification) layer.

Uses an independent cross-entropy loss per class. Trained with SGD with momentum, plus weight decay only on convolutional weights. Calculates and plots class-wise and mean intersection-over-union. Checkpoints the network every epoch.

Note: This code does not achieve great results (achieves ~40 IoU fairly quickly, but converges there). Contributions to fix this are welcome! The goal of this repo is to provide strong, simple and efficient baselines for semantic segmentation using the FCN method, so this shouldn't be restricted to using ResNet 34 etc.

Requirements

Instructions

Install all of the required software. To feasibly run the training, CUDA is needed. The crop size and batch size can be tailored to your GPU memory (the default crop and batch sizes use ~10GB of GPU RAM).
Register on the Cityscapes website to access the dataset.
Download and extract the training/validation RGB data (leftImg8bit_trainvaltest) and ground truth data (gtFine_trainvaltest).
Run python main.py <options>.

First a Dataset object is set up, returning the RGB inputs, one-hot targets (for independent classification) and label targets. During training, the images are randomly cropped and horizontally flipped. Testing calculates IoU scores and produces a subset of coloured predictions that match the coloured ground truth.

References

[1] Fully convolutional networks for semantic segmentation
[2] Deep Residual Learning for Image Recognition

Fully convolutional networks for semantic segmentation

Related tags

Overview

FCN-semantic-segmentation

Requirements

Instructions

References

Owner

Kai Arulkumaran

[cvpr22] Perturbed and Strict Mean Teachers for Semi-supervised Semantic Segmentation

A simplistic and efficient pure-python neural network library from Phys Whiz with CPU and GPU support.

[3DV 2020] PeeledHuman: Robust Shape Representation for Textured 3D Human Body Reconstruction

Official Repository for Machine Learning class - Physics Without Frontiers 2021

Additional environments compatible with OpenAI gym

Non-Imaging Transient Reconstruction And TEmporal Search (NITRATES)

A repository with exploration into using transformers to predict DNA ↔ transcription factor binding

This repository contains the code for the CVPR 2021 paper "GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields"

Revisiting Video Saliency: A Large-scale Benchmark and a New Model (CVPR18, PAMI19)

Real-Time Multi-Contact Model Predictive Control via ADMM

PyTorch code for our paper "Image Super-Resolution with Non-Local Sparse Attention" (CVPR2021).

Voice control for Garry's Mod

https://arxiv.org/abs/2102.11005

FusionNet: A deep fully residual convolutional neural network for image segmentation in connectomics

project page for VinVL

RMNA: A Neighbor Aggregation-Based Knowledge Graph Representation Learning Model Using Rule Mining

Tensorflow implementation of "Learning Deep Features for Discriminative Localization"

Official implementation for paper: A Latent Transformer for Disentangled Face Editing in Images and Videos.

RETRO-pytorch - Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch

CoSMA: Convolutional Semi-Regular Mesh Autoencoder. From Paper "Mesh Convolutional Autoencoder for Semi-Regular Meshes of Different Sizes"