[TPAMI 2021] iOD: Incremental Object Detection via Meta-Learning

Last update: Jan 04, 2023

Overview

Incremental Object Detection via Meta-Learning

To appear in an upcoming issue of the IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

arXiv paper: https://arxiv.org/abs/2003.08798

Abstract

In a real-world setting, object instances from new classes can be continuously encountered by object detectors. When existing object detectors are applied to such scenarios, their performance on old classes deteriorates significantly. A few efforts have been reported to address this limitation, all of which apply variants of knowledge distillation to avoid catastrophic forgetting.

We note that although distillation helps to retain previous learning, it obstructs fast adaptability to new tasks, which is a critical requirement for incremental learning. In this pursuit, we propose a meta-learning approach that learns to reshape model gradients, such that information across incremental tasks is optimally shared. This ensures a seamless information transfer via a meta-learned gradient preconditioning that minimizes forgetting and maximizes knowledge transfer. In comparison to existing meta-learning methods, our approach is task-agnostic, allows incremental addition of new-classes and scales to high-capacity models for object detection.

We evaluate our approach on a variety of incremental learning settings defined on PASCAL-VOC and MS COCO datasets, where our approach performs favourably well against state-of-the-art methods.

Installation and setup

Install the Detectron2 library that is packages along with this code base. See INSTALL.md.
Download and extract Pascal VOC 2007 to ./datasets/VOC2007/
Use the starter script: run.sh

Trained Models and Logs

Setting	Reported mAP	Reproduced mAP	Commands	Models and logs
19+1	70.2	70.4	run.sh	Google Drive
15+5	67.8	69.6	run.sh	Google Drive
10+10	66.3	67.3	run.sh	Google Drive

Configurations with which the above results were reproduced:

Python version: 3.6.7
PyTorch version: 1.3.0
CUDA version: 11.0
GPUs: 4 x NVIDIA GTX 1080-ti

Acknowledgement

The code is build on top of Detectron2 library.

Citation

If you find our research useful, please consider citing us:

@article{joseph2021incremental,
  title={Incremental object detection via meta-learning},
  author={Joseph, KJ and Rajasegaran, Jathushan and Khan, Salman and Khan, Fahad Shahbaz and Balasubramanian, Vineeth},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
  year={2021}
}

[TPAMI 2021] iOD: Incremental Object Detection via Meta-Learning

Related tags

Overview

Incremental Object Detection via Meta-Learning

To appear in an upcoming issue of the IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

Abstract

Installation and setup

Trained Models and Logs

Configurations with which the above results were reproduced:

Acknowledgement

Citation

Owner

Joseph K J

Dark Finix: All in one hacking framework with almost 100 tools

Implementation for HFGI: High-Fidelity GAN Inversion for Image Attribute Editing

Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch

A very simple baseline to estimate 2D & 3D SMPL-compatible keypoints from a single color image.

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

implementation of paper - You Only Learn One Representation: Unified Network for Multiple Tasks

Simple Pose: Rethinking and Improving a Bottom-up Approach for Multi-Person Pose Estimation

StyleGAN2 with adaptive discriminator augmentation (ADA) - Official TensorFlow implementation

You Only Look One-level Feature (YOLOF), CVPR2021, Detectron2

Official code repository for the publication "Latent Equilibrium: A unified learning theory for arbitrarily fast computation with arbitrarily slow neurons"

PyG (PyTorch Geometric) - A library built upon PyTorch to easily write and train Graph Neural Networks (GNNs)

Barbershop: GAN-based Image Compositing using Segmentation Masks (SIGGRAPH Asia 2021)

RobustVideoMatting and background composing in one model by using onnxruntime.

Unsupervised Foreground Extraction via Deep Region Competition

Deep Learning as a Cloud API Service.

Losslandscapetaxonomy - Taxonomizing local versus global structure in neural network loss landscapes

First-Order Probabilistic Programming Language

Framework to build and train RL algorithms

This project is the official implementation of our accepted ICLR 2021 paper BiPointNet: Binary Neural Network for Point Clouds.

ADSPM: Attribute-Driven Spontaneous Motion in Unpaired Image Translation