Normalization Calibration (NorCal) for Long-Tailed Object Detection and Instance Segmentation

Last update: Dec 25, 2022

Related tags

Deep Learning NorCal

Overview

NorCal

Normalization Calibration (NorCal) for Long-Tailed Object Detection and Instance Segmentation

On Model Calibration for Long-Tailed Object Detection and Instance Segmentation.

Advances in Neural Information Processing Systems (NeurIPS), 2021.

Tai-Yu Pan*, Cheng Zhang*, Yandong Li, Hexiang Hu, Dong Xuan, Soravit Changpinyo, Boqing Gong, Wei-Lun Chao.

Introduction

Vanilla models for object detection and instance segmentation suffer from the heavy bias toward detecting frequent objects in the long-tailed setting. Existing methods address this issue mostly during training, e.g., by re-sampling or re-weighting.

In this paper, we investigate a largely overlooked approach -- post-processing calibration of confidence scores. We propose NorCal, Normalized Calibration for long-tailed object detection and instance segmentation, a simple and straightforward recipe that reweighs the predicted scores of each class by its training sample size. We show that separately handling the background class and normalizing the scores over classes for each proposal are keys to achieving superior performance. On the LVIS dataset, NorCal can effectively improve nearly all the baseline models not only on rare classes but also on common and frequent classes. Finally, we conduct extensive analysis and ablation studies to offer insights into various modeling choices and mechanisms of our approach.

Installation

Install Detectron2 following the instructions.

Evaluation

Model evaluation can be done similarly:

cd /path/to/detectron2/projects/NorCal
python train_net.py --config-file configs/lvis_v0.5_mask_rcnn_R_50_FPN.yaml --eval-only MODEL.WEIGHTS /path/to/model_checkpoint TEST.CALIBRATION.GAMMA gamma

Citation

Please cite with the following bibtex if you find it useful.

@inproceedings{pan2021norcal,
  title={On Model Calibration for Long-Tailed Object Detection and Instance Segmentation},
  author={Pan, Tai-Yu and Zhang, Cheng and Li, Yandong and Hu, Hexiang and Xuan, Dong and Changpinyo, Soravit and Gong, Boqing and Chao, Wei-Lun},
  booktitle = {NeurIPS},
  year={2021}
}

Normalization Calibration (NorCal) for Long-Tailed Object Detection and Instance Segmentation

Related tags

Overview

NorCal

Normalization Calibration (NorCal) for Long-Tailed Object Detection and Instance Segmentation

Introduction

Installation

Evaluation

Citation

Owner

Tai-Yu (Daniel) Pan

Official PyTorch implementation of PS-KD

Global-Local Context Network for Person Search

Representing Long-Range Context for Graph Neural Networks with Global Attention

Dynamical Wasserstein Barycenters for Time Series Modeling

This repository provides some of the code implemented and the data used for the work proposed in "A Cluster-Based Trip Prediction Graph Neural Network Model for Bike Sharing Systems".

Implementation of MA-Trace - a general-purpose multi-agent RL algorithm for cooperative environments.

nextPARS, a novel Illumina-based implementation of in-vitro parallel probing of RNA structures.

Video2x - A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K, SRMD and RealSR.

An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.

A Closer Look at Reference Learning for Fourier Phase Retrieval

[NeurIPS 2021]: Are Transformers More Robust Than CNNs? (Pytorch implementation & checkpoints)

Algo-burn - Script to configure an Algorand address as a "burn" address for one or more ASA tokens

Bi-level feature alignment for versatile image translation and manipulation (Under submission of TPAMI)

CDGAN: Cyclic Discriminative Generative Adversarial Networks for Image-to-Image Transformation

A python package to perform same transformation to coco-annotation as performed on the image.

Scientific Computation Methods in C and Python (Open for Hacktoberfest 2021)

Multi-Modal Machine Learning toolkit based on PaddlePaddle.

GB-CosFace: Rethinking Softmax-based Face Recognition from the Perspective of Open Set Classification

Graph Convolutional Neural Networks with Data-driven Graph Filter (GCNN-DDGF)

AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation