Visualizing Yolov5's layers using GradCam

Last update: Jan 01, 2023

Overview

YOLO-V5 GRADCAM

I constantly desired to know to which part of an object the object-detection models pay more attention. So I searched for it, but I didn't find any for Yolov5. Here is my implementation of Grad-cam for YOLO-v5. To load the model I used the yolov5's main codes, and for computing GradCam I used the codes from the gradcam_plus_plus-pytorch repository. Please follow my GitHub account and star ⭐ the project if this functionality benefits your research or projects.

Installation

pip install -r requirements.txt

Infer

python main.py --model-path yolov5s.pt --img-path images/cat-dog.jpg --output-dir outputs

NOTE: If you don't have any weights and just want to test, don't change the model-path argument. The yolov5s model will be automatically downloaded thanks to the download function from yolov5.

NOTE: For more input arguments, check out the main.py or run the following command:

python main.py -h

Examples

Note

I checked the code, but I couldn't find an explanation for why the truck's heatmap does not show anything. Please inform me or create a pull request if you find the reason.

TO Do

Add GradCam++
Add ScoreCam
Add the functionality to the deep_utils library

References

Citation

Please cite yolov5-gradcam if it helps your research. You can use the following BibTeX entry:

@misc{deep_utils,
	title = {yolov5-gradcam},
	author = {Mohammadi Kazaj, Pooya},
	howpublished = {\url{github.com/pooya-mohammadi/yolov5-gradcam}},
	year = {2021}
}

Visualizing Yolov5's layers using GradCam

Related tags

Overview

YOLO-V5 GRADCAM

Installation

Infer

Examples

Note

TO Do

References

Citation

Owner

Pooya Mohammadi Kazaj

Sharpness-Aware Minimization for Efficiently Improving Generalization

Forecasting directional movements of stock prices for intraday trading using LSTM and random forest

Official implementation of "UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-wise Perspective with Transformer"

NFT-Price-Prediction-CNN - Using visual feature extraction, prices of NFTs are predicted via CNN (Alexnet and Resnet) architectures.

DFM: A Performance Baseline for Deep Feature Matching

Axel - 3D printed robotic hands and they controll with Raspberry Pi and Arduino combo

Pose estimation with MoveNet Lightning

Functional TensorFlow Implementation of Singular Value Decomposition for paper Fast Graph Learning

Code for Greedy Gradient Ensemble for Visual Question Answering （ICCV 2021, Oral）

Pytorch Lightning Distributed Accelerators using Ray

Attack classification models with transferability, black-box attack; unrestricted adversarial attacks on imagenet

Development Kit for the SoccerNet Challenge

Public scripts, services, and configuration for running a smart home K3S network cluster

FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control

Unofficial implementation of MUSIQ (Multi-Scale Image Quality Transformer)

Bag of Tricks for Natural Policy Gradient Reinforcement Learning

Codes for ACL-IJCNLP 2021 Paper "Zero-shot Fact Verification by Claim Generation"

Code for Two-stage Identifier: "Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition"

Machine Learning Time-Series Platform

[CVPR 2022] Official PyTorch Implementation for "Reference-based Video Super-Resolution Using Multi-Camera Video Triplets"