This repository contains code for the paper "Disentangling Label Distribution for Long-tailed Visual Recognition", published at CVPR' 2021

Last update: Oct 18, 2022

Related tags

Deep Learning LADE

Overview

Disentangling Label Distribution for Long-tailed Visual Recognition (CVPR 2021)

Arxiv link
Blog post
This codebase is built on Causal Norm.

Install

conda create -n longtail pip python=3.7 -y
source activate longtail
conda install pytorch torchvision cudatoolkit=10.1 -c pytorch
pip install pyyaml tqdm matplotlib sklearn h5py tensorboard

Training

Preliminaries

Download pretrained caffe resnet152 model for Places-LT: please refer to link.
Prepare dataset: CIFAR-100, Places-LT, ImageNet-LT, iNaturalist 2018
- Please download those datasets following Decoupling.

CIFAR-100 training

For CIFAR-100 with imbalance ratio 0.01, using LADE:

python main.py --seed 1 --cfg config/CIFAR100_LT/lade.yaml --exp_name lade2021/cifar100_imb0.01_lade --cifar_imb_ratio 0.01 --remine_lambda 0.01 --alpha 0.1 --gpu 0

Places-LT training

For PC Softmax:

python main.py --seed 1 --cfg config/Places_LT/ce.yaml --exp_name lade2021/places_pc_softmax --lr 0.05 --gpu 0,1,2,3

For LADE:

python main.py --seed 1 --cfg config/Places_LT/lade.yaml --exp_name lade2021/places_lade --lr 0.05 --remine_lambda 0.1 --alpha 0.005 --gpu 0,1,2,3

ImageNet-LT training

For LADE:

python main.py --seed 1 --cfg config/ImageNet_LT/lade.yaml  --exp_name lade2021/imagenet_lade --lr 0.05 --remine_lambda 0.5 --alpha 0.05 --gpu 0,1,2,3

iNaturalist18 training

For LADE:

python main.py --seed 1 --cfg ./config/iNaturalist18/lade.yaml --exp_name lade2021/inat_lade --lr 0.1 --alpha 0.05 --gpu 0,1,2,3

Evaluate on shifted test set & Confidence calibration

For Imagenet (Section 4.3, 4.4):

./notebooks/imagenet-shift-calib.ipynb

For CIFAR-100 (Supplementary material):

./notebooks/cifar100-shift-calib.ipynb

License

The use of this software is released under BSD-3.

Citation

If you find our paper or this project helps your research, please kindly consider citing our paper in your publications.

@article{hong2020disentangling,
  title={Disentangling Label Distribution for Long-tailed Visual Recognition},
  author={Hong, Youngkyu and Han, Seungju and Choi, Kwanghee and Seo, Seokjun and Kim, Beomsu and Chang, Buru},
  journal={arXiv preprint arXiv:2012.00321},
  year={2020}
}

This repository contains code for the paper "Disentangling Label Distribution for Long-tailed Visual Recognition", published at CVPR' 2021

Related tags

Overview

Disentangling Label Distribution for Long-tailed Visual Recognition (CVPR 2021)

Install

Training

Preliminaries

CIFAR-100 training

Places-LT training

ImageNet-LT training

iNaturalist18 training

Evaluate on shifted test set & Confidence calibration

License

Citation

Owner

Hyperconnect

A check for whether the dependency jobs are all green.

BERT model training impelmentation using 1024 A100 GPUs for MLPerf Training v1.1

Jittor implementation of Recursive-NeRF: An Efficient and Dynamically Growing NeRF

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

Safe Policy Optimization with Local Features

Blind visual quality assessment on 360° Video based on progressive learning

The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"

Trading Gym is an open source project for the development of reinforcement learning algorithms in the context of trading.

An image processing project uses Viola-jones technique to detect faces and then use SIFT algorithm for recognition.

OpenCVのGrabCut()を利用したセマンティックセグメンテーション向けアノテーションツール(Annotation tool using GrabCut() of OpenCV. It can be used to create datasets for semantic segmentation.)

Hepsiburada - Hepsiburada Urun Bilgisi Cekme

Pi-NAS: Improving Neural Architecture Search by Reducing Supernet Training Consistency Shift (ICCV 2021)

The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.

Learning to Reconstruct 3D Manhattan Wireframes from a Single Image

A 2D Visual Localization Framework based on Essential Matrices [ICRA2020]

Convert Python 3 code to CUDA code.

Code of the paper "Multi-Task Meta-Learning Modification with Stochastic Approximation".

Fine-grained Control of Image Caption Generation with Abstract Scene Graphs

Numenta published papers code and data

A map update dataset and benchmark