Source code of our work: "Benchmarking Deep Models for Salient Object Detection"

Last update: Dec 30, 2022

Related tags

Overview

SALOD

Source code of our work: "Benchmarking Deep Models for Salient Object Detection".
In this works, we propose a new benchmark for SALient Object Detection (SALOD) methods.

We re-implement 14 methods using same settings, including input size, data loader and evaluation metrics (thanks to Metrics). Hyperparameters of optimizer are different because of various network structures and objective functions. We try our best to tune the optimizer for these models to achieve the best performance one-by-one. Some other networks are debugging now, it is welcome for your contributions on these networks to obtain better performance.

Properties

A unify interface for new models. To develop a new network, you only need to 1) set configs; 2) define network; 3) define loss function. See methods/template.
We build a new dataset by collecting several prevalent datasets in SOD task.
Easy to adopt different backbones (Available backbones: ResNet-50, VGG-16, MobileNet-v2, EfficientNet-B0, GhostNet, Res2Net)
Testing all networks on your own device. By input the name of network, you can test all available methods in our benchmark. Comparisons includes FPS, GFLOPs, model size and multiple effectiveness metrics.
We implement a loss factory that you can change the loss functions using command line parameters.

Available Methods:

Methods	Publish.	Input	Weight	Optim.	LR	Epoch	Paper	Src Code
DHSNet	CVPR2016	320^2	95M	Adam	2e-5	30	openaccess	Pytorch
NLDF	CVPR2017	320^2	161M	Adam	1e-5	30	openaccess	Pytorch/TF
Amulet	ICCV2017	320^2	312M	Adam	1e-5	30	openaccess	Pytorch
SRM	ICCV2017	320^2	240M	Adam	5e-5	30	openaccess	Pytorch
PicaNet	CVPR2018	320^2	464M	SGD	1e-2	30	openaccess	Pytorch
DSS	TPAMI2019	320^2	525M	Adam	2e-5	30	IEEE/ArXiv	Pytorch
BASNet	CVPR2019	320^2	374M	Adam	1e-5	30	openaccess	Pytorch
CPD	CVPR2019	320^2	188M	Adam	1e-5	30	openaccess	Pytorch
PoolNet	CVPR2019	320^2	267M	Adam	5e-5	30	openaccess	Pytorch
EGNet	ICCV2019	320^2	437M	Adam	5e-5	30	openaccess	Pytorch
SCRN	ICCV2019	320^2	100M	SGD	1e-2	30	openaccess	Pytorch
GCPA	AAAI2020	320^2	263M	SGD	1e-2	30	aaai.org	Pytorch
ITSD	CVPR2020	320^2	101M	SGD	5e-3	30	openaccess	Pytorch
MINet	CVPR2020	320^2	635M	SGD	1e-3	30	openaccess	Pytorch
`Tuning`	-----	-----	------	------	-----	-----	-----	-----
*PAGE	CVPR2019	320^2	------	------	-----	-----	openaccess	TF
*PFA	CVPR2019	320^2	------	------	-----	-----	openaccess	Pytorch
*F3Net	AAAI2020	320^2	------	------	-----	-----	aaai.org	Pytorch
*PFPN	AAAI2020	320^2	------	------	-----	-----	aaai.org	Pytorch
*LDF	CVPR2020	320^2	------	------	-----	-----	openaccess	Pytorch

Usage

# model_name: lower-cased method name. E.g. poolnet, egnet, gcpa, dhsnet or minet.
python3 train.py model_name --gpus=0

python3 test.py model_name --gpus=0 --weight=path_to_weight 

python3 test_fps.py model_name --gpus=0

# To evaluate generated maps:
python3 eval.py --pre_path=path_to_maps

Results

We report benchmark results here.
More results please refer to Reproduction, Few-shot and Generalization.

Notice: please contact us if you get better results.

VGG16-based:

Methods	#Param.	GFLOPs	Tr. Time	FPS	max-F	ave-F	Fbw	MAE	SM	EM
DHSNet	15.4	52.5	7.5	69.8	.884	.815	.812	.049	.880	.893
Amulet	33.2	1362	12.5	35.1	.855	.790	.772	.061	.854	.876
NLDF	24.6	136	9.7	46.3	.886	.824	.828	.045	.881	.898
SRM	37.9	73.1	7.9	63.1	.857	.779	.769	.060	.859	.874
PicaNet	26.3	74.2	40.5*	8.8	.889	.819	.823	.046	.884	.899
DSS	62.2	99.4	11.3	30.3	.891	.827	.826	.046	.888	.899
BASNet	80.5	114.3	16.9	32.6	.906	.853	.869	.036	.899	.915
CPD	29.2	85.9	10.5	36.3	.886	.815	.792	.052	.885	.888
PoolNet	52.5	236.2	26.4	23.1	.902	.850	.852	.039	.898	.913
EGNet	101	178.8	19.2	16.3	.909	.853	.859	.037	.904	.914
SCRN	16.3	47.2	9.3	24.8	.896	.820	.822	.046	.891	.894
GCPA	42.8	197.1	17.5	29.3	.903	.836	.845	.041	.898	.907
ITSD	16.9	76.3	15.2*	30.6	.905	.820	.834	.045	.901	.896
MINet	47.8	162	21.8	23.4	.900	.839	.852	.039	.895	.909

ResNet50-based:

Methods	#Param.	GFLOPs	Tr. Time	FPS	max-F	ave-F	Fbw	MAE	SM	EM
DHSNet	24.2	13.8	3.9	49.2	.909	.830	.848	.039	.905	.905
Amulet	79.8	1093.8	6.3	35.1	.895	.822	.835	.042	.894	.900
NLDF	41.1	115.1	9.2	30.5	.903	.837	.855	.038	.898	.910
SRM	61.2	20.2	5.5	34.3	.882	.803	.812	.047	.885	.891
PicaNet	106.1	36.9	18.5*	14.8	.904	.823	.843	.041	.902	.902
DSS	134.3	35.3	6.6	27.3	.894	.821	.826	.045	.893	.898
BASNet	95.5	47.2	12.2	32.8	.917	.861	.884	.032	.909	.921
CPD	47.9	14.7	7.7	22.7	.906	.842	.836	.040	.904	.908
PoolNet	68.3	66.9	10.2	33.9	.912	.843	.861	.036	.907	.912
EGNet	111.7	222.8	25.7	10.2	.917	.851	.867	.036	.912	.914
SCRN	25.2	12.5	5.5	19.3	.910	.838	.845	.040	.906	.905
GCPA	67.1	54.3	6.8	37.8	.916	.841	.866	.035	.912	.912
ITSD	25.7	19.6	5.7	29.4	.913	.825	.842	.042	.907	.899
MINet	162.4	87	11.7	23.5	.913	.851	.871	.034	.906	.917

Create New Model

To create a new model, you can copy the template folder and modify it as you want.

cp -r ./methods/template ./methods/new_name

More details please refer to python files in template floder.

Loss Factory

We supply a Loss Factory for an easier way to tune the loss functions. You can set --loss and --lw parameters to use it.

Here are some examples:

loss_dict = {'b': BCE, 's': SSIM, 'i': IOU, 'd': DICE, 'e': Edge, 'c': CTLoss}

python train.py ... --loss=bd
# loss = 1 * bce_loss + 1 * dice_loss

python train.py ... --loss=bs --lw=0.3,0.7
# loss = 0.3 * bce_loss + 0.7 * ssim_loss

python train.py ... --loss=bsid --lw=0.3,0.1,0.5,0.2
# loss = 0.3 * bce_loss + 0.1 * ssim_loss + 0.5 * iou_loss + 0.2 * dice_loss

Source code of our work: "Benchmarking Deep Models for Salient Object Detection"

Related tags

Overview

SALOD

Properties

Available Methods:

Usage

Results

VGG16-based:

ResNet50-based:

Create New Model

Loss Factory

Owner

Semi-supervised Domain Adaptation via Minimax Entropy

Human pose estimation from video plays a critical role in various applications such as quantifying physical exercises, sign language recognition, and full-body gesture control.

PaddleBoBo是基于PaddlePaddle和PaddleSpeech、PaddleGAN等开发套件的虚拟主播快速生成项目

This repository contains a pytorch implementation of "HeadNeRF: A Real-time NeRF-based Parametric Head Model (CVPR 2022)".

The reference baseline of final exam for XMU machine learning course

Plaything for Autistic Children (demo for PaddlePaddle/Wechaty/Mixlab project)

TFOD-MASKRCNN - Tensorflow MaskRCNN With Python

A trusty face recognition research platform developed by Tencent Youtu Lab

Pytorch Implementation for CVPR2018 Paper: Learning to Compare: Relation Network for Few-Shot Learning

toroidal - a lightweight transformer library for PyTorch

The King is Naked: on the Notion of Robustness for Natural Language Processing

Point detection through multi-instance deep heatmap regression for sutures in endoscopy

This is an (re-)implementation of DeepLab-ResNet in TensorFlow for semantic image segmentation on the PASCAL VOC dataset.

Sequence-to-Sequence learning using PyTorch

Domain Generalization for Mammography Detection via Multi-style and Multi-view Contrastive Learning

Python binding for Khiva library.

The repository includes the code for training cell counting applications. (Keras + Tensorflow)

A GOOD REPRESENTATION DETECTS NOISY LABELS

Geometric Algebra package for JAX

Avalanche RL: an End-to-End Library for Continual Reinforcement Learning