Semantic Segmentation for Aerial Imagery using Convolutional Neural Network

Last update: Sep 23, 2022

Related tags

Deep Learning ssai

Overview

ssai-cnn

Semantic Segmentation for Aerial Imagery

Extract building and road from aerial imagery

Requirements

OpenCV 2.4.10
Boost 1.57.0
Boost.NumPy
Caffe (modified caffe: https://github.com/mitmul/caffe)
- NOTE: Build the ssai branch of the above repository

Data preparation

$ bash shells/donwload.sh
$ python scripts/create_dataset.py --dataset multi
$ python scripts/create_dataset.py --dataset single
$ python scripts/create_dataset.py --dataset roads_mini
$ python scripts/create_dataset.py --dataset roads
$ python scripts/create_dataset.py --dataset buildings
$ python scripts/create_dataset.py --dataset merged

Massatusetts Building & Road dataset

mass_roads
- train: 8458173 patches
  - epoch: 66079 mini-batches (mini-batch size: 128)
- valid: 126281 patches
  - epoch: 987 mini-batches (mini-batch size: 128)
- test: 440932 patches
  - epoch: 3445 mini-batches (mini-batch size: 128)
mass_roads_mini, mass_buildings, mass_merged
- train: 1119872 patches
  - epoch: 8749 mini-batches (mini-batch size: 128)
- valid: 36100 patches
  - epoch: 282 mini-batches (mini-batch size: 128)
- test: 89968 patches
  - epoch: 703 mini-batches (mini-batch size: 128)

Create Models

$ python scripts/create_models.py --seed seeds/model_seeds.json --caffe_dir $HOME/lib/caffe/build/install

Start training

$ bash shells/train.sh models/Mnih_CNN

will create a directory named results/Mnih_CNN_{started date}.

Prediction

$ cd results/Mnih_CNN_{started date}
$ python ../../scripts/test_prediction.py --model predict.prototxt --weight snapshots/Mnih_CNN_iter_1000000.caffemodel --img_dir ../../data/mass_merged/test/sat --channel 3

Build Library for Evaluation

$ cd lib
$ mkdir build
$ cd build
$ cmake ../
$ make

Evaluation

$ cd results/Mnih_CNN_{started date}
$ python ../../scripts/test_evaluation.py --map_dir ../../data/mass_merged/test/map --result_dir prediction_1000000 --channel 3

Model averaging

$ python ../scripts/batch_evaluation.py --offset True
$ mkdir Mnih_CNN_Merged
$ cd Mnih_CNN_Merged
$ python ../../scripts/test_evaluation.py --map_dir ../../data/mass_merged/test/map --result_dir ./prediction_100000 --channel 3 --offset 0 --pad 31

Semantic Segmentation for Aerial Imagery using Convolutional Neural Network

Related tags

Overview

This repo has been deprecated because whole things are re-implemented by using Chainer and I did refactoring for many codes. So please check this newer version: https://github.com/mitmul/ssai-cnn

Semantic Segmentation for Aerial Imagery

Requirements

Data preparation

Massatusetts Building & Road dataset

Create Models

Start training

Prediction

Build Library for Evaluation

Evaluation

Model averaging

Owner

Shunta Saito

chainladder - Property and Casualty Loss Reserving in Python

Official code of paper "PGT: A Progressive Method for Training Models on Long Videos" on CVPR2021

Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

PyTorch implementation of MoCo: Momentum Contrast for Unsupervised Visual Representation Learning

An auto discord account and token generator. Automatically verifies the phone number. Works without proxy. Bypasses captcha.

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection?

NUANCED is a user-centric conversational recommendation dataset that contains 5.1k annotated dialogues and 26k high-quality user turns.

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it

SpanNER: Named EntityRe-/Recognition as Span Prediction

Wav2Vec for speech recognition, classification, and audio classification

Deep Learning Package based on TensorFlow

Playing around with FastAPI and streamlit to create a YoloV5 object detector

For IBM Quantum Challenge Africa 2021, 9 September (07:00 UTC) - 20 September (23:00 UTC).

EMNLP 2021 Findings' paper, SCICAP: Generating Captions for Scientific Figures

3DMV jointly combines RGB color and geometric information to perform 3D semantic segmentation of RGB-D scans.

Run object detection model on the Raspberry Pi

This is the official code for the paper "Ad2Attack: Adaptive Adversarial Attack for Real-Time UAV Tracking".