Official implementation of "Dynamic Anchor Learning for Arbitrary-Oriented Object Detection" (AAAI2021).

Last update: Nov 28, 2022

Overview

DAL

This project hosts the official implementation for our AAAI 2021 paper:

Dynamic Anchor Learning for Arbitrary-Oriented Object Detection [arxiv] [comments].

Abstract

In this paper, we propose a dynamic anchor learning (DAL) method, which utilizes the newly deﬁned matching degree to comprehensively evaluate the localization potential of the anchors and carry out a more efﬁcient label assignment process. In this way, the detector can dynamically select high-quality anchors to achieve accurate object detection, and the divergence between classiﬁcation and regression will be alleviated.

Getting Started

The codes build Rotated RetinaNet with the proposed DAL method for rotation object detection. The supported datasets include: DOTA, HRSC2016, ICDAR2013, ICDAR2015, UCAS-AOD, NWPU VHR-10, VOC.

Installation

Insatll requirements:

pip install -r requirements.txt
pip install git+git://github.com/lehduong/torch-warmup-lr.git

Build the Cython and CUDA modules:

cd $ROOT/utils
sh make.sh
cd $ROOT/utils/overlaps_cuda
python setup.py build_ext --inplace

Installation for DOTA_devkit:

cd $ROOT/datasets/DOTA_devkit
sudo apt-get install swig
swig -c++ -python polyiou.i
python setup.py build_ext --inplace

Inference

You can use the following command to test a dataset. Note that weight, img_dir, dataset,hyp should be modified as appropriate.

python demo.py

Train

Move the dataset to the $ROOT directory.
Generate imageset files for daatset division via:

cd $ROOT/datasets
python generate_imageset.py

Modify the configuration file hyp.py and arguments in train.py, then start training:

python train.py

Evaluation

Different datasets use different test methods. For UCAS-AOD/HRSC2016/VOC/NWPU VHR-10, you need to prepare labels in the appropriate format in advance. Take evaluation on HRSC2016 for example:

cd $ROOT/datasets/evaluate
python hrsc2gt.py

then you can conduct evaluation:

python eval.py

Note that :

the script needs to be executed only once, but testing on different datasets needs to be executed again.
the imageset file used in hrsc2gt.py is generated from generate_imageset.py.

Main Results

Method	Dataset	Bbox	Backbone	Input Size	mAP/F1
DAL	DOTA	OBB	ResNet-101	800 x 800	71.78
DAL	UCAS-AOD	OBB	ResNet-101	800 x 800	89.87
DAL	HRSC2016	OBB	ResNet-50	416 x 416	88.60
DAL	ICDAR2015	OBB	ResNet-101	800 x 800	82.4
DAL	ICDAR2013	HBB	ResNet-101	800 x 800	81.3
DAL	NWPU VHR-10	HBB	ResNet-101	800 x 800	88.3
DAL	VOC 2007	HBB	ResNet-101	800 x 800	76.1

Detections

Citation

If you find our work or code useful in your research, please consider citing:

@article{ming2020dynamic,
  title={Dynamic Anchor Learning for Arbitrary-Oriented Object Detection},
  author={Ming, Qi and Zhou, Zhiqiang and Miao, Lingjuan and Zhang, Hongwei and Li, Linhao},
  journal={arXiv preprint arXiv:2012.04150},
  year={2020}
}

If you have any questions, please contact me via issue or email.

Official implementation of "Dynamic Anchor Learning for Arbitrary-Oriented Object Detection" (AAAI2021).

Related tags

Overview

DAL

Abstract

Getting Started

Installation

Inference

Train

Evaluation

Main Results

Detections

Citation

Owner

ming71

This is an official implementation for "Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation".

A Survey on Deep Learning Technique for Video Segmentation

This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.

This project uses Template Matching technique for object detecting by detection of template image over base image.

ICRA 2021 "Towards Precise and Efficient Image Guided Depth Completion"

Breast cancer is been classified into benign tumour and malignant tumour.

AI-based, context-driven network device ranking

A texturizer that I just made. Nothing special here.

Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision

Anomaly detection in multi-agent trajectories: Code for training, evaluation and the OpenAI highway simulation.

⚡ H2G-Net for Semantic Segmentation of Histopathological Images

A PyTorch-based R-YOLOv4 implementation which combines YOLOv4 model and loss function from R3Det for arbitrary oriented object detection.

TensorFlow implementation of Barlow Twins (Barlow Twins: Self-Supervised Learning via Redundancy Reduction)

Official implementation for "Low-light Image Enhancement via Breaking Down the Darkness"

GT China coal model

The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"

Distributed DataLoader For Pytorch Based On Ray

Public repository containing materials used for Feed Forward (FF) Neural Networks article.

Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"

Several simple examples for popular neural network toolkits calling custom CUDA operators.