Multispectral Object Detection with Yolov5

Last update: Jan 01, 2023

Overview

Multispectral-Object-Detection

Intro

Official Code for Cross-Modality Fusion Transformer for Multispectral Object Detection.

Multispectral Object Detection with Transformer and Yolov5

Citation

If you use this repo for your research, please cite our paper:

@article{fang2021cross,
  title={Cross-Modality Fusion Transformer for Multispectral Object Detection},
  author={Fang Qingyun and Han Dapeng and Wang Zhaokui},
  journal={arXiv preprint arXiv:2111.00273},
  year={2021}
}

Installation

Python>=3.6.0 is required with all requirements.txt installed including PyTorch>=1.7 (The same as yolov5 https://github.com/ultralytics/yolov5 ).

Clone the repo

git clone https://github.com/DocF/multispectral-object-detection

Install requirements

$ cd  multispectral-object-detection
$ pip install -r requirements.txt

Dataset

-[FLIR] download A new aligned version.

-[LLVIP] download

-[VEDAI] download

Run

Download the pretrained weights

yolov5 weights:

CFT weights:

Add the some file

create runs/train, runs/test and runs/detect three files for save the results.

Change the data cfg

some example in data/multispectral/

Train Test and Detect

train: python train.py

test: python test.py

detect: python detect_twostream.py

Results

Dataset	CFT	mAP50	mAP75	mAP
FLIR		73.0	32.0	37.4
FLIR	✔️	77.7 (Δ4.7)	34.8 (Δ2.8)	40.0 (Δ2.6)
LLVIP		95.8	71.4	62.3
LLVIP	✔️	97.5 (Δ1.7)	72.9 (Δ1.5)	63.6 (Δ1.3)
VEDAI		79.7	47.7	46.8
VEDAI	✔️	85.3 (Δ5.6)	65.9(Δ18.2)	56.0 (Δ9.2)

Multispectral Object Detection with Yolov5

Related tags

Overview

Multispectral-Object-Detection

Intro

Citation

Installation

Clone the repo

Install requirements

Dataset

Run

Download the pretrained weights

Add the some file

Change the data cfg

Train Test and Detect

Results

Owner

Richard Fang

Official code for our ICCV paper: "From Continuity to Editability: Inverting GANs with Consecutive Images"

Pose estimation with MoveNet Lightning

Implementation of FitVid video prediction model in JAX/Flax.

Self-Supervised Monocular DepthEstimation with Internal Feature Fusion(arXiv), BMVC2021

[ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization

Keyword-BERT: Keyword-Attentive Deep Semantic Matching

Zero-shot Learning by Generating Task-specific Adapters

[PyTorch] Official implementation of CVPR2021 paper "PointDSC: Robust Point Cloud Registration using Deep Spatial Consistency". https://arxiv.org/abs/2103.05465

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

Code for MarioNette: Self-Supervised Sprite Learning, in NeurIPS 2021

A tool for calculating distortion parameters in coordination complexes.

Official repository for the paper "Self-Supervised Models are Continual Learners" (CVPR 2022)

Rotary Transformer

Retinal vessel segmentation based on GT-UNet

An efficient implementation of GPNN

Data Augmentation with Variational Autoencoders

TuckER: Tensor Factorization for Knowledge Graph Completion

Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming soon!

Based on the paper "Geometry-aware Instance-reweighted Adversarial Training" ICLR 2021 oral

Face Detection and Alignment using Multi-task Cascaded Convolutional Networks (MTCNN)