R3Det based on mmdet 2.19.0

Last update: Dec 15, 2022

Related tags

Overview

R³Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object

Installation

# install mmdetection first if you haven't installed it yet. (Refer to mmdetection for details.)
pip install mmdet==2.19.0

# install r3det (Compiling rotated ops is a little time-consuming.)
pip install -r requirements.txt
pip install -v -e .

It is best to use opencv-python greater than 4.5.1 because its angle representation has been changed in 4.5.1. The following experiments are all run with 4.5.3.

Quick Start

Please change path in configs to your data path.

# train
CUDA_VISIBLE_DEVICES=0 PORT=29500 \
./tools/dist_train.sh configs/rretinanet/rretinanet_obb_r50_fpn_1x_dota_v3.py 1

# submission
CUDA_VISIBLE_DEVICES=0 PORT=29500 \
./tools/dist_test.sh configs/rretinanet/rretinanet_obb_r50_fpn_1x_dota_v3.py \
        work_dirs/rretinanet_obb_r50_fpn_1x_dota_v3/epoch_12.pth 1 --format-only\
        --eval-options submission_dir=work_dirs/rretinanet_obb_r50_fpn_1x_dota_v3/Task1_results

For DOTA dataset, please crop the original images into 1024×1024 patches with an overlap of 200 by run

python tools/split/img_split.py --base_json \
       tools/split/split_configs/split_configs/dota1_0/ss_trainval.json

python tools/split/img_split.py --base_json \
       tools/split/split_configs/dota1_0/ss_test.json

Please change path in ss_trainval.json, ss_test.json to your path. (Forked from BboxToolkit, which is faster then DOTA_Devkit.)

Angle Representations

Three angle representations are built-in, which can freely switch in the config.

v1 (from R³Det): [-PI/2, 0)
v2 (from S²ANet): [-Pi/4, 3PI/4)
v3 (from OBBDetection): [-PI/2, PI/2)

The differences of the three angle representations are reflected in poly2obb, obb2poly, obb2xyxy, obb2hbb, hbb2obb, etc. [More], And according to the above three papers, the coders of them are different.

DeltaXYWHAOBBoxCoder
- v1：None
- v2：Constrained angle + Projection of dx and dy + Normalized with PI
- v3：Constrained angle and length&width + Projection of dx and dy
DeltaXYWHAHBBoxCoder
- v1：None
- v2：Constrained angle + Normalized with PI
- v3：Constrained angle and length&width + Normalized with 2PI

We believe that different coders are the key reason for the different baselines in different papers. The good news is that all the above coders can be freely switched in R3Det. In addition, R3Det also provide 4 NMS ops and 3 IoU_Calculators for rotation detection as follows:

nms.type
- v1：v1
- v2：v2
- v3：v3
- mmcv: mmcv
iou_calculator
- v1：RBboxOverlaps2D_v1
- v2：RBboxOverlaps2D_v2
- v3：RBboxOverlaps2D_v3

Performance

DOTA1.0 (Task1)

Model	Backbone	Lr schd	MS	RR	Angle	box AP	Official	Download
RRetinaNet HBB	R50-FPN	1x	-	-	v1	65.19	65.73	Baidu:0518/Google
RRetinaNet OBB	R50-FPN	1x	-	-	v3	68.20	69.40	Baidu:0518/Google
RRetinaNet OBB	R50-FPN	1x	-	-	v2	68.64	68.40	Baidu:0518/Google
R³Det	R50-FPN	1x	-	-	v1	70.41	70.66	Baidu:0518/Google
R³Det*	R50-FPN	1x	-	-	v1	70.86	-	Baidu:0518/Google

MS means multiple scale image split.
RR means random rotation.

Citation

@inproceedings{yang2021r3det,
    title={R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object},
    author={Yang, Xue and Yan, Junchi and Feng, Ziming and He, Tao},
    booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
    volume={35},
    number={4},
    pages={3163--3171},
    year={2021}
}

R3Det based on mmdet 2.19.0

Related tags

Overview

R³Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object

Installation

Quick Start

Angle Representations

Performance

Citation

Owner

SJTU-Thinklab-Det

Pytoydl: A toy deep learning framework built upon numpy.

Experiments for Operating Systems Lab (ETCS-352)

An Open-Source Package for Information Retrieval.

Reverse engineering Rosetta 2 in M1 Mac

A general 3D Object Detection codebase in PyTorch.

Pytorch implementation of set transformer

GoodNews Everyone! Context driven entity aware captioning for news images

A tiny, pedagogical neural network library with a pytorch-like API.

ULMFiT for Genomic Sequence Data

Python 3 module to print out long strings of text with intervals of time inbetween

This is Official implementation for "Pose-guided Feature Disentangling for Occluded Person Re-Identification Based on Transformer" in AAAI2022

[Nature Machine Intelligence' 21] "Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence"

CaFM-pytorch ICCV ACCEPT Introduction of dataset VSD4K

Breaking the Dilemma of Medical Image-to-image Translation

reimpliment of DFANet: Deep Feature Aggregation for Real-Time Semantic Segmentation

D2Go is a toolkit for efficient deep learning

Self-training with Weak Supervision (NAACL 2021)

Implementation for Simple Spectral Graph Convolution in ICLR 2021

Crowd-sourced Annotation of Human Motion.