Official pytorch implementation of Active Learning for deep object detection via probabilistic modeling (ICCV 2021)

Last update: Jan 06, 2023

Overview

Active Learning for Deep Object Detection via Probabilistic Modeling

This repository is the official PyTorch implementation of Active Learning for Deep Object Detection via Probabilistic Modeling, ICCV 2021.

The proposed method is implemented based on the SSD pytorch.

Our approach relies on mixture density networks to estimate, in a single forward pass of a single model, both localization and classification uncertainties, and leverages them in the scoring function for active learning.

Our method performs on par with multiple model-based methods (e.g., ensembles and MC-Dropout). Therefore, our method provides the best trade-off between accuracy and computational cost.

License

To view a NVIDIA Source Code License for this work, visit https://github.com/NVlabs/AL-MDN/blob/main/LICENSE

Requirements

For setup and data preparation, please refer to the README in SSD pytorch.

Code was tested in virtual environment with Python 3+ and Pytorch 1.1.

Training

Make directory mkdir weights and cd weights.
Download the FC-reduced VGG-16 backbone weight in the weights directory, and cd ...
If necessary, change the VOC_ROOT in data/voc0712.py or COCO_ROOT in data/coco.py.
Please refer to data/config.py for configuration.
Run the training code:

# Supervised learning
CUDA_VISIBLE_DEVICES=<GPU_ID> python train_ssd_gmm_supervised_learning.py

# Active learning
CUDA_VISIBLE_DEVICES=<GPU_ID> python train_ssd_gmm_active_learining.py

Evaluation

To evaluate on MS-COCO, change the COCO_ROOT_EVAL in data/coco_eval.py.
Run the evaluation code:

# Evaluation on PASCAL VOC
python eval_voc.py --trained_model <trained weight path>

# Evaluation on MS-COCO
python eval_coco.py --trained_model <trained weight path>

Visualization

Run the visualization code:

python demo.py --trained_model <trained weight path>

Citation

@InProceedings{Choi_2021_ICCV,
    author    = {Choi, Jiwoong and Elezi, Ismail and Lee, Hyuk-Jae and Farabet, Clement and Alvarez, Jose M.},
    title     = {Active Learning for Deep Object Detection via Probabilistic Modeling},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {10264-10273}
}

Official pytorch implementation of Active Learning for deep object detection via probabilistic modeling (ICCV 2021)

Related tags

Overview

Active Learning for Deep Object Detection via Probabilistic Modeling

License

Requirements

Training

Evaluation

Visualization

Citation

Owner

NVIDIA Research Projects

Generate text captions for images from their CLIP embeddings. Includes PyTorch model code and example training script.

An efficient 3D semantic segmentation framework for Urban-scale point clouds like SensatUrban, Campus3D, etc.

Deploy pytorch classification model using Flask and Streamlit

Goal of the project : Detecting Temporal Boundaries in Sign Language videos

Fake-user-agent-traffic-geneator - Python CLI Tool to generate fake traffic against URLs with configurable user-agents

这是一个yolox-pytorch的源码，可以用于训练自己的模型。

Code, environments, and scripts for the paper: "How Private Is Your RL Policy? An Inverse RL Based Analysis Framework"

Synthetic Scene Text from 3D Engines

Lazy, a tool for running things in idle time

Towards Rolling Shutter Correction and Deblurring in Dynamic Scenes (CVPR2021)

This project provides the proof of the uniqueness of the equilibrium and the global asymptotic stability.

PlaidML is a framework for making deep learning work everywhere.

Using machine learning to predict undergrad college admissions.

Efficient Sparse Attacks on Videos using Reinforcement Learning

Neighborhood Contrastive Learning for Novel Class Discovery

Why Are You Weird? Infusing Interpretability in Isolation Forest for Anomaly Detection

Neighbor2Seq: Deep Learning on Massive Graphs by Transforming Neighbors to Sequences

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

PyTorch implemention of ICCV'21 paper SGPA: Structure-Guided Prior Adaptation for Category-Level 6D Object Pose Estimation

Implements a fake news detection program using classifiers.