A TensorFlow implementation of FCN-8s

Last update: Aug 08, 2022

Overview

FCN-8s implementation in TensorFlow

Overview
Examples and demo video
Dependencies
How to use it
Download pre-trained VGG-16

Overview

This is a TensorFlow implementation of the FCN-8s model architecture for semantic image segmentation introduced by Shelhamer et al. in the paper Fully Convolutional Networks for Semantic Segmentation.

This repository only contains the 'all-at-once' version of the FCN-8s model, which converges significantly faster than the version trained in stages. A convolutionalized VGG-16 model trained on ImageNet classification is provided and serves as the encoder of the FCN-8s. Sufficient documentation and a tutorial on how to train, evaluate and use the model for prediction are also provided. Some useful TensorBoard summaries can be recorded out of the box.

Examples and demo video

Below are some prediction examples of the model trained on the Cityscapes dataset for 13,000 steps at batch size 16, at which point the model achieves a mean IoU of 38.2% on the validation dataset. This is far from convergence of course, the purpose of these examples is just to demonstrate that the code works and the model learns. You can watch the model in action on the Cityscapes demo videos here.

Dependencies

Python 3.x
TensorFlow 1.x
Numpy
Scipy
OpenCV (for data augmentation)
tqdm

How to use it

fcn8s_tutorial.ipynb explains how to train and evaluate the model and how to make and visualize predictions.

Download pre-trained VGG-16

You can download the pre-trained, convolutionalized VGG-16 model here

A TensorFlow implementation of FCN-8s

Related tags

Overview

FCN-8s implementation in TensorFlow

Contents

Overview

Examples and demo video

Dependencies

How to use it

Download pre-trained VGG-16

Owner

Pierluigi Ferrari

Unofficial Pytorch Implementation of WaveGrad2

Privacy-Preserving Portrait Matting [ACM MM-21]

DPC: Unsupervised Deep Point Correspondence via Cross and Self Construction (3DV 2021)

Mesh Graphormer is a new transformer-based method for human pose and mesh reconsruction from an input image

A Re-implementation of the paper "A Deep Learning Framework for Character Motion Synthesis and Editing"

TensorFlow implementation of Barlow Twins (Barlow Twins: Self-Supervised Learning via Redundancy Reduction)

The official repo for CVPR2021——ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search.

The code release of paper Low-Light Image Enhancement with Normalizing Flow

Employs neural networks to classify images into four categories: ship, automobile, dog or frog

Source code for "OmniPhotos: Casual 360° VR Photography"

Official implementation of our CVPR2021 paper "OTA: Optimal Transport Assignment for Object Detection" in Pytorch.

Google Recaptcha solver.

C3d-pytorch - Pytorch porting of C3D network, with Sports1M weights

Chatbot in 200 lines of code using TensorLayer

Multi-Objective Reinforced Active Learning

Dataset for the Research2Clinics @ NeurIPS 2021 Paper: What Do You See in this Patient? Behavioral Testing of Clinical NLP Models

Code for ACM MM 2020 paper "NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination"

UniLM AI - Large-scale Self-supervised Pre-training across Tasks, Languages, and Modalities

Kernel Point Convolutions

Using VideoBERT to tackle video prediction