An implementation of RetinaNet in PyTorch.

Last update: Jan 04, 2023

Overview

RetinaNet

An implementation of RetinaNet in PyTorch.

Installation
Training
Evaluation
Todo
Credits

Installation

Install PyTorch and torchvision.
For faster data augmentation, install pillow-simd:

pip uninstall -y pillow
pip install pillow-simd

Training

COCO 2017

First, install pycocotools:

git clone https://github.com/pdollar/coco/
cd coco/PythonAPI
make
python setup.py install
cd ../..
rm -r coco

Then download COCO 2017 into ./datasets/COCO/:

cd datasets
mkdir COCO
cd COCO

If your using wget:

wget http://images.cocodataset.org/zips/train2017.zip &&
wget http://images.cocodataset.org/zips/val2017.zip &&
wget http://images.cocodataset.org/annotations/annotations_trainval2017.zip

If your using aria2c (recommended on for higher bandwidth connections and for allowing resumption of the download. Tune the number of max concurrent downloads (-j) and max connections per server (-x) as needed:

aria2c -x 10 -j 10 http://images.cocodataset.org/zips/train2017.zip &&
aria2c -x 10 -j 10 http://images.cocodataset.org/zips/val2017.zip &&
aria2c -x 10 -j 10 http://images.cocodataset.org/annotations/annotations_trainval2017.zip

unzip *.zip
rm *.zip

Then just run:

python train_coco.py

Pascal VOC

cd datasets
mkdir VOC
cd VOC

wget http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtrainval_06-Nov-2007.tar &&
wget http://host.robots.ox.ac.uk/pascal/VOC/voc2012/VOCtrainval_11-May-2012.tar &&
wget http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtest_06-Nov-2007.tar

aria2c -x 10 -j 10 http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtrainval_06-Nov-2007.tar &&
aria2c -x 10 -j 10 http://host.robots.ox.ac.uk/pascal/VOC/voc2012/VOCtrainval_11-May-2012.tar &&
aria2c -x 10 -j 10 http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtest_06-Nov-2007.tar

tar xf *.tar
rm *.tar

Then just run:

python train_voc.py

Custom Dataset

Lots to write here. 😉

Evaluation

To evaluate an image on a trained model:

python eval.py [checkpoint_path] [image_path]

This will create an image (output.jpg) with bounding box annotations.

Todo

Finish converting the COCO dataset class to work with batches.
Train COCO 2017 for 90,000 iterations and save a reusable checkpoint.
Try training on Pascal VOC and add download instructions.
Produce bounding box outputs for a few sanity check images.
Upload trained weights to Github releases.
Train on the 🔮 magic proprietary dataset ✨ .

An implementation of RetinaNet in PyTorch.

Related tags

Overview

RetinaNet

Installation

Training

COCO 2017

Pascal VOC

Custom Dataset

Evaluation

Todo

Credits

Owner

Conner Vercellino

Stacked Generative Adversarial Networks

Isaac Gym Reinforcement Learning Environments

Official repository for Fourier model that can generate periodic signals

Unofficial Implement PU-Transformer

A toolset for creating Qualtrics-based IAT experiments

Readings for "A Unified View of Relational Deep Learning for Polypharmacy Side Effect, Combination Therapy, and Drug-Drug Interaction Prediction."

Dense Deep Unfolding Network with 3D-CNN Prior for Snapshot Compressive Imaging, ICCV2021 [PyTorch Code]

MoCap-Solver: A Neural Solver for Optical Motion Capture Data

A PyTorch-based R-YOLOv4 implementation which combines YOLOv4 model and loss function from R3Det for arbitrary oriented object detection.

A highly modular PyTorch framework with a focus on Neural Architecture Search (NAS).

Companion code for "Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees"

JORLDY an open-source Reinforcement Learning (RL) framework provided by KakaoEnterprise

CAMoE + Dual SoftMax Loss (DSL): Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss

Image Super-Resolution by Neural Texture Transfer

PyTorch implementation of PP-LCNet: A Lightweight CPU Convolutional Neural Network

My 1st place solution at Kaggle Hotel-ID 2021

Predict stock movement with Machine Learning and Deep Learning algorithms

This project is a loose implementation of paper "Algorithmic Financial Trading with Deep Convolutional Neural Networks: Time Series to Image Conversion Approach"

TorchX is a library containing standard DSLs for authoring and running PyTorch related components for an E2E production ML pipeline.

The software associated with a paper accepted at EMNLP 2021 titled "Open Knowledge Graphs Canonicalization using Variational Autoencoders".