Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch

Last update: Dec 02, 2022

Overview

[AAAI 2021]DropLoss for Long-Tail Instance Segmentation

[AAAI 2021] DropLoss for Long-Tail Instance Segmentation
Ting-I Hsieh*, Esther Robb*, Hwann-Tzong Chen, Jia-Bin Huang.
Association for the Advancement of Artificial Intelligence (AAAI), 2021

Figure: Measuring the performance tradeoff. Comparison between rare, common, and frequent categories AP for baselines and our method. We visualize the tradeoff for ‘common vs. frequent’ and ‘rare vs. frequent’as a Pareto frontier, where the top-right position indicates an ideal tradeoff between objectives. DropLoss achieves an improved tradeoff between object categories, resulting in higher overall AP.

This project is a pytorch implementation of DropLoss for Long-Tail Instance Segmentation. DropLoss improves long-tail instance segmentation by adaptively removing discouraging gradients to infrequent classes. A majority of the code is modified from facebookresearch/detectron2 and tztztztztz/eql.detectron2.

Progress

Training code.
Evaluation code.
LVIS v1.0 datasets.
Provide checkpoint model.

Installation

Requirements

Linux or macOS with Python = 3.7
PyTorch = 1.4 and torchvision that matches the PyTorch installation. Install them together at pytorch.org to make sure of this
OpenCV (optional but needed for demos and visualization)

Build Detectron2 from Source

gcc & g++ ≥ 5 are required. ninja is recommended for faster build.

After installing them, run:

python -m pip install 'git+https://github.com/facebookresearch/detectron2.git'
# (add --user if you don't have permission)

# Or, to install it from a local clone:
git clone https://github.com/facebookresearch/detectron2.git
python -m pip install -e detectron2


# Or if you are on macOS
CC=clang CXX=clang++ ARCHFLAGS="-arch x86_64" python -m pip install ......

Remove the latest fvcore package and install an older version:

pip uninstall fvcore
pip install fvcore==0.1.1.post200513

LVIS Dataset

Following the instructions of README.md to set up the LVIS dataset.

Training

To train a model with 8 GPUs run:

cd /path/to/detectron2/projects/DropLoss
python train_net.py --config-file configs/droploss_mask_rcnn_R_50_FPN_1x.yaml --num-gpus 8

Evaluation

Model evaluation can be done similarly:

cd /path/to/detectron2/projects/DropLoss
python train_net.py --config-file configs/droploss_mask_rcnn_R_50_FPN_1x.yaml --eval-only MODEL.WEIGHTS /path/to/model_checkpoint

Citing DropLoss

If you use DropLoss, please use the following BibTeX entry.

@inproceedings{DBLP:conf/aaai/Ting21,
  author 	= {Hsieh, Ting-I and Esther Robb and Chen, Hwann-Tzong and Huang, Jia-Bin},
  title     = {DropLoss for Long-Tail Instance Segmentation},
  booktitle = {Proceedings of the Workshop on Artificial Intelligence Safety 2021
               (SafeAI 2021) co-located with the Thirty-Fifth {AAAI} Conference on
               Artificial Intelligence {(AAAI} 2021), Virtual, February 8, 2021},
  year      = {2021}
  }

Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch

Related tags

Overview

[AAAI 2021]DropLoss for Long-Tail Instance Segmentation

Progress

Installation

Requirements

Build Detectron2 from Source

LVIS Dataset

Training

Evaluation

Citing DropLoss

Owner

Tim

TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks

TensorFlow 2 AI/ML library wrapper for openFrameworks

MLPs for Vision and Langauge Modeling (Coming Soon)

Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style

TLXZoo - Pre-trained models based on TensorLayerX

[WACV21] Code for our paper: Samuel, Atzmon and Chechik, "From Generalized zero-shot learning to long-tail with class descriptors"

BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting

Original Implementation of Prompt Tuning from Lester, et al, 2021

VOGUE: Try-On by StyleGAN Interpolation Optimization

Pytorch implementation of NeurIPS 2021 paper: Geometry Processing with Neural Fields.

Code repository for the paper: Hierarchical Kinematic Probability Distributions for 3D Human Shape and Pose Estimation from Images in the Wild (ICCV 2021)

Personalized Transfer of User Preferences for Cross-domain Recommendation (PTUPCDR)

Implementation of Analyzing and Improving the Image Quality of StyleGAN (StyleGAN 2) in PyTorch

Official implementation of SIGIR'2021 paper: "Sequential Recommendation with Graph Neural Networks".

Benchmark for the generalization of 3D machine learning models across different remeshing/samplings of a surface.

Code for our paper "Sematic Representation for Dialogue Modeling" in ACL2021

Implicit Graph Neural Networks

This repository is based on Ultralytics/yolov5, with adjustments to enable polygon prediction boxes.

PyTorch Implementation of Temporal Output Discrepancy for Active Learning, ICCV 2021

PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021