Multi-Glimpse Network With Python

Last update: May 10, 2022

Related tags

Deep Learning MGNet

Overview

Multi-Glimpse Network

Our code requires Python ≥ 3.8

Installation

For example, venv + pip:

$ python3 -m venv env
$ source env/bin/activate
(env) $ python3 -m pip install -r requirements.txt

Evaluation

Accuracy on clean images

Create ImageNet100 from ImageNet (using symbolic links).

$ python3 tools/create_imagenet100.py tools/imagenet100.txt \
    /path/to/ImageNet /path/to/ImageNet100

Download checkpoints from Google Drive.
Test accuracy.

$ export dataset="--train_dir /path/to/ImageNet100/train \
    --val_dir /path/to/ImageNet100/val \
    --dataset imagenet --num_class 100"
# Baseline
$ python3 main.py $dataset --test --n_iter 1 --scale 1.0  --model resnet18 \
    --checkpoint resnet18_baseline
# Ours
$ python3 main.py $dataset --test --n_iter 4 --scale 2.33 --model resnet18 \
    --checkpoint resnet18_ours --alpha 0.6 --s 0.02

Add the flag --flop_count to count the approximate FLOPs for the inference of an image. (using fvcore)

Accuracy on adversarial attacks (PGD)

Test adversarial accuracy.

# Baseline
$ python3 main.py $dataset --test --n_iter 1 --scale 1.0  --adv --step_k 10 \
    --model resnet18 --checkpoint resnet18_baseline
# Ours
$ python3 main.py $dataset --test --n_iter 4 --scale 2.33 --adv --step_k 10 \
    --model resnet18 --checkpoint resnet18_ours --alpha 0.6 --s 0.02

Accuracy on common corruptions

Create ImageNet100-C from ImageNet-C (using symbolic links).

$ python3 tools/create_imagenet100c.py  \
    tools/imagenet100.txt  /path/to/ImageNet-C/ /path/to/ImageNet100-C/

Test for a single corruption.

$ export dataset="--train_dir /path/to/ImageNet100/train \
    --val_dir /path/to/ImageNet100-C/pixelate/5 \
    --dataset imagenet --num_class 100"
# Baseline
$ python3 main.py $dataset --test --n_iter 1 --scale 1.0  --model resnet18 \
    --checkpoint resnet18_baseline
# Ours
$ python3 main.py $dataset --test --n_iter 4 --scale 2.33 --model resnet18 \
    --checkpoint resnet18_ours --alpha 0.6 --s 0.02

A simple script to test all corruptions and collect results.

# Modify tools/eval_imagenet100c.py and run it to generate script
$ python3 tools/eval_imagenet100c.py /home2/ImageNet100-C/ > run.sh
# Evaluate
$ bash run.sh
# Collect results
$ python3 tools/collect_imagenet100c.py

Training

$ export dataset="--train_dir /path/to/ImageNet100/train \
    --val_dir /path/to/ImageNet100/val \
    --dataset imagenet --num_class 100"
# Baseline
$ python3 main.py $dataset --epochs 400 --n_iter 1 --scale 1.0 \
    --model resnet18 --gpu 0,1,2,3
# Ours
$ python3 main.py $dataset --epochs 400 --n_iter 4 --scale 2.33 \
    --model resnet18 --alpha 0.6 --s 0.02  --gpu 0,1,2,3

Check tensorboard for the logs. (When training with multiple gpus, the log value may be scaled by the number of gpus except for the validation accuracy)

tensorboard  --logdir=logs

Note that we left our exploration in the code for further study, e.g., self-supervised spatial guidance, dynamic gradient re-scaling operation.

Multi-Glimpse Network With Python

Related tags

Overview

Multi-Glimpse Network

Installation

Evaluation

Accuracy on clean images

Accuracy on adversarial attacks (PGD)

Accuracy on common corruptions

Training

Owner

Machine Unlearning with SISA

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

[CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos

For holding anime-related object classification and detection models

Flexible-Modal Face Anti-Spoofing: A Benchmark

Acoustic mosquito detection code with Bayesian Neural Networks

All the essential resources and template code needed to understand and practice data structures and algorithms in python with few small projects to demonstrate their practical application.

GuideDog is an AI/ML-based mobile app designed to assist the lives of the visually impaired, 100% voice-controlled

[CVPRW 2021] Code for Region-Adaptive Deformable Network for Image Quality Assessment

Implementation of PersonaGPT Dialog Model

A 3D Dense mapping backend library of SLAM based on taichi-Lang designed for the aerial swarm.

OpenMMLab 3D Human Parametric Model Toolbox and Benchmark

PyTorch Personal Trainer: My framework for deep learning experiments

QAHOI: Query-Based Anchors for Human-Object Interaction Detection (paper)

PyTorch implementation of Barlow Twins.

Computer Vision and Pattern Recognition, NUS CS4243, 2022

SymPy-powered, Wolfram|Alpha-like answer engine totally in your browser, without backend computation

CVPR 2021 Challenge on Super-Resolution Space

Scripts used to make and evaluate OpenAlex's concept tagging model

An implementation of Deep Graph Infomax (DGI) in PyTorch