Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning, NeurIPS 2021 (Spotlight)

Last update: Dec 12, 2022

Overview

Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning, NeurIPS 2021 (Spotlight)

Abstract

Due to the limited and even imbalanced data, semi-supervised semantic segmentation tends to have poor performance on some certain categories, e.g., tailed categories in Cityscapes dataset which exhibits a long-tailed label distribution. Existing approaches almost all neglect this problem, and treat categories equally. Some popular approaches such as consistency regularization or pseudo-labeling may even harm the learning of under-performing categories, that the predictions or pseudo labels of these categories could be too inaccurate to guide the learning on the unlabeled data. In this paper, we look into this problem, and propose a novel framework for semi-supervised semantic segmentation, named adaptive equalization learning (AEL). AEL adaptively balances the training of well and badly performed categories, with a confidence bank to dynamically track category-wise performance during training. The confidence bank is leveraged as an indicator to tilt training towards under-performing categories, instantiated in three strategies: 1) adaptive Copy-Paste and CutMix data augmentation approaches which give more chance for under-performing categories to be copied or cut; 2) an adaptive data sampling approach to encourage pixels from under-performing category to be sampled; 3) a simple yet effective re-weighting method to alleviate the training noise raised by pseudo-labeling. Experimentally, AEL outperforms the state-of-the-art methods by a large margin on the Cityscapes and Pascal VOC benchmarks under various data partition protocols. For more details, please refer to our NeurIPS paper (arxiv).

Installation

Check INSTALL.md for installation instructions.

Training and Evaluation

For example, perform training and evaluation with 1/2 data parttition on Cityscapes dataset.

cd experiments/cityscapes_2
bash train.sh

For other partition protocols, change n_sup in config.yaml.

TODO

Other SOTA semi-supervised segmentation methods

Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning, NeurIPS 2021 (Spotlight)

Related tags

Overview

Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning, NeurIPS 2021 (Spotlight)

Abstract

Installation

Training and Evaluation

TODO

Owner

Hanzhe Hu

PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INTERSPEECH 2020)

Code for the paper One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation, CVPR 2021.

DeLag: Detecting Latency Degradation Patterns in Service-based Systems

PerfFuzz: Automatically Generate Pathological Inputs for C/C++ programs

Milano is a tool for automating hyper-parameters search for your models on a backend of your choice.

WHENet - ONNX, OpenVINO, TFLite, TensorRT, EdgeTPU, CoreML, TFJS, YOLOv4/YOLOv4-tiny-3L

AgML is a comprehensive library for agricultural machine learning

The project of phase's key role in complex and real NN

Single cell current best practices tutorial case study for the paper:Luecken and Theis, "Current best practices in single-cell RNA-seq analysis: a tutorial"

RLMeta is a light-weight flexible framework for Distributed Reinforcement Learning Research.

The implementation of our CIKM 2021 paper titled as: "Cross-Market Product Recommendation"

A library of multi-agent reinforcement learning components and systems

A simple Python library for stochastic graphical ecological models

RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.

The tl;dr on a few notable transformer/language model papers + other papers (alignment, memorization, etc).

Decensoring Hentai with Deep Neural Networks. Formerly named DeepMindBreak.

Explaining neural decisions contrastively to alternative decisions.

Learning to Disambiguate Strongly Interacting Hands via Probabilistic Per-Pixel Part Segmentation [3DV 2021 Oral]

Sound Source Localization for AI Grand Challenge 2021

This repository contains the code for EMNLP-2021 paper "Word-Level Coreference Resolution"