YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）

Last update: Dec 30, 2022

Overview

YOLOv5_DOTA_OBB

YOLOv5 in DOTA_OBB dataset with CSL_label.(Oriented Object Detection)

Datasets and pretrained checkpoint

Datasets : DOTA
Pretrained Checkpoint or Demo Files :
- train,detect_and_evaluate_demo_files.(6666)
- yolov5x.pt.(6666)
- yolov5l.pt.(6666)
- yolov5m.pt.(6666)
- yolov5s.pt.(6666)
- YOLOv5_DOTAv1.5_OBB.pt.(6666)

Fuction

train.py. Train.
detect.py. Detect and visualize the detection result. Get the detection result txt.
evaluation.py. Merge the detection result and visualize it. Finally evaluate the detector

Installation (Linux Recommend, Windows not Recommend)

1. Python 3.8 or later with all requirements.txt dependencies installed, including torch>=1.7. To install run:

$   pip install -r requirements.txt

2. Install swig

$   cd  \.....\yolov5_DOTA_OBB\utils
$   sudo apt-get install swig

3. Create the c++ extension for python

$   swig -c++ -python polyiou.i
$   python setup.py build_ext --inplace

More detailed explanation

想要了解相关实现的细节和原理可以看我的知乎文章:
YOLOv5_DOTAv1.5(遥感旋转目标检测，全踩坑记录);

Usage Example

1. 'Get Dataset'

Split the DOTA_OBB image and labels. Trans DOTA format to YOLO longside format.
You can refer to hukaixuan19970627/DOTA_devkit_YOLO.
The Oriented YOLO Longside Format is:

$  classid    x_c   y_c   longside   shortside    Θ    Θ∈[0, 180)


* longside: The longest side of the oriented rectangle.

* shortside: The other side of the oriented rectangle.

* Θ: The angle between the longside and the x-axis(The x-axis rotates clockwise).x轴顺时针旋转遇到最长边所经过的角度

WARNING: IMAGE SIZE MUST MEETS 'HEIGHT = WIDTH'

2. 'train.py'

All same as ultralytics/yolov5. You better train demo files first before train your custom dataset.
Single GPU training:

$ python train.py  --batch-size 4 --device 0

Multi GPU training: DistributedDataParallel Mode

python -m torch.distributed.launch --nproc_per_node 4 train.py --sync-bn --device 0,1,2,3

3. 'detect.py'

Download the demo files.
Then run the demo. Visualize the detection result and get the result txt files.

$  python detect.py

4. 'evaluation.py'

Run the detect.py demo first. Then change the path with yours:

evaluation
(
        detoutput=r'/....../DOTA_demo_view/detection',
        imageset=r'/....../DOTA_demo_view/row_images',
        annopath=r'/....../DOTA_demo_view/row_DOTA_labels/{:s}.txt'
)
draw_DOTA_image
(
        imgsrcpath=r'/...../DOTA_demo_view/row_images',
        imglabelspath=r'/....../DOTA_demo_view/detection/result_txt/result_merged',
        dstpath=r'/....../DOTA_demo_view/detection/merged_drawed'
)

Run the evaluation.py demo. Get the evaluation result and visualize the detection result which after merged.

$  python evaluation.py

有问题反馈

在使用中有任何问题，欢迎反馈给我，可以用以下联系方式跟我交流

知乎（@略略略）
代码问题提issues,其他问题请知乎上联系

感激

感谢以下的项目,排名不分先后

关于作者

  Name  : "胡凯旋"
  describe myself："咸鱼一枚"

YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）

Related tags

Overview

YOLOv5_DOTA_OBB

Datasets and pretrained checkpoint

Fuction

Installation (Linux Recommend, Windows not Recommend)

More detailed explanation

Usage Example

有问题反馈

感激

关于作者

Owner

A novel region proposal network for more general object detection ( including scene text detection ).

The open source extract transaction infomation by using OCR.

python ocr using tesseract/ with EAST opencv detector

利用Paddle框架复现CRAFT

A Joint Video and Image Encoder for End-to-End Retrieval

ocroseg - This is a deep learning model for page layout analysis / segmentation.

Handwriting Recognition System based on a deep Convolutional Recurrent Neural Network architecture

With the virtual keyboard, you can write on the real time images by combining the thumb and index fingers on the letter you want.

Scene text recognition

Contextual speed detection for python

Program created with opencv that allows you to automatically count your repetitions on several fitness exercises.

A synthetic data generator for text recognition

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

A Python script to capture images from multiple webcams at once and save them into your local machine

This tool will help you convert your text to handwriting xD

A document scanner application for laptops/desktops developed using python, Tkinter and OpenCV.

A version of nrsc5-gui that merges the interface developed by cmnybo with the architecture developed by zefie in order to start a new baseline that is not heavily dependent upon Python processing.

docstrum

Corner-based Region Proposal Network