Per-Pixel Classification is Not All You Need for Semantic Segmentation

Last update: Jan 08, 2023

Related tags

Deep Learning MaskFormer

Overview

MaskFormer: Per-Pixel Classification is Not All You Need for Semantic Segmentation

Bowen Cheng, Alexander G. Schwing, Alexander Kirillov

[arXiv] [Project] [BibTeX]

Features

Better results while being more efficient.
Unified view of semantic- and instance-level segmentation tasks.
Support major semantic segmentation datasets: ADE20K, Cityscapes, COCO-Stuff, Mapillary Vistas.
Support ALL Detectron2 models.

Installation

See installation instructions.

Getting Started

See Preparing Datasets for MaskFormer.

See Getting Started with MaskFormer.

Model Zoo and Baselines

We provide a large set of baseline results and trained models available for download in the MaskFormer Model Zoo.

License

Shield:

The majority of MaskFormer is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

However portions of the project are available under separate license terms: Swin-Transformer-Semantic-Segmentation is licensed under the MIT license.

Citing MaskFormer

If you use MaskFormer in your research or wish to refer to the baseline results published in the Model Zoo, please use the following BibTeX entry.

@article{cheng2021maskformer,
  title={Per-Pixel Classification is Not All You Need for Semantic Segmentation},
  author={Bowen Cheng and Alexander G. Schwing and Alexander Kirillov},
  journal={arXiv},
  year={2021}
}

Per-Pixel Classification is Not All You Need for Semantic Segmentation

Related tags

Overview

MaskFormer: Per-Pixel Classification is Not All You Need for Semantic Segmentation

Features

Installation

Getting Started

Model Zoo and Baselines

License

Citing MaskFormer

Owner

Facebook Research

Predict bus arrival time using VertexAI and Nvidia's Jetson Nano

DeepMReye: magnetic resonance-based eye tracking using deep neural networks

B2EA: An Evolutionary Algorithm Assisted by Two Bayesian Optimization Modules for Neural Architecture Search

Survival analysis in Python

This is an official implementation for "PlaneRecNet".

YoHa - A practical hand tracking engine.

git《Learning Pairwise Inter-Plane Relations for Piecewise Planar Reconstruction》(ECCV 2020) GitHub:

The code of Zero-shot learning for low-light image enhancement based on dual iteration

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

CVPR 2021: "Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE"

Research code of ICCV 2021 paper "Mesh Graphormer"

Image Classification - A research on image classification and auto insurance claim prediction, a systematic experiments on modeling techniques and approaches

This is the official repository for our paper: ''Pruning Self-attentions into Convolutional Layers in Single Path''.

Official Pytorch Implementation for Splicing ViT Features for Semantic Appearance Transfer presenting Splice

Linear image-to-image translation

This repository is for DSA and CP scripts for reference.

Autonomous Robots Kalman Filters

Neon-erc20-example - Example of creating SPL token and wrapping it with ERC20 interface in Neon EVM

Repository of best practices for deep learning in Julia, inspired by fastai

(to be released) [NeurIPS'21] Transformers Generalize DeepSets and Can be Extended to Graphs and Hypergraphs