iNAS: Integral NAS for Device-Aware Salient Object Detection

Last update: Dec 02, 2022

Related tags

Deep Learning iNAS

Overview

iNAS: Integral NAS for Device-Aware Salient Object Detection

Introduction

Integral search design (jointly consider backbone/head structures, design/deploy devices).

Covers mainstream handcraft saliency head design.

SOTA performance with large latency reduction on diverse hardware platforms.

Updates

0.1.0 was released in 15/11/2021:

Support training and searching on Salient Object Detection (SOD).
Support four stages in one-shot architecture search.
Support stand-alone model inference with json configuration.
Provide off-the-shelf models and experiment logs.

Please refer to changelog.md for details and release history.

Dependencies and Installation

Dependencies

Python >= 3.7 (Recommend to use Anaconda or Miniconda)
PyTorch >= 1.7
NVIDIA GPU + CUDA

Install from a local clone

Clone the repo

git clone https://github.com/guyuchao/iNAS.git

Install dependent packages

conda create -n iNAS python=3.8
conda install -c pytorch pytorch=1.7 torchvision cudatoolkit=10.2
pip install -r requirements.txt

Install iNAS
Please run the following commands in the iNAS root path to install iNAS:
```
python setup.py develop
```

Dataset Preparation

Folder Structure

iNAS
├── iNAS
├── experiment
├── scripts
├── options
├── datasets
│   ├── saliency
│   │   ├── DUTS-TR/            # Contains both images (.jpg) and labels (.png).
│   │   ├── DUTS-TR.lst         # Specify the image-label pair for training or testing.
│   │   ├── ECSSD/
│   │   ├── ECSSD.lst
│   │   ├── ...

Common Image SOD Datasets

We provide a list of common salient object detection datasets.

Name	Datasets	Short Description	Download
SOD Training	DUTS-TR	_{10553 images for SOD training}	Google Drive / Baidu Drive (psd: w69q)
SOD Testing	ECSSD	_{1000 images for SOD testing}
	DUT-OMRON	_{5168 images for SOD testing}
	DUTS-TE	_{5019 images for SOD testing}
	HKU-IS	_{4447 images for SOD testing}
	PASCAL-S	_{850 images for SOD testing}

How to Use

The iNAS integrates four main steps of one-shot neural architecture search:

Train supernet: Provide a fast performance evaluator for searching.
Search models: Find a pareto frontier based on performance evaluator and resource evaluator.
Convert weight/Retrain/Finetune: Promote searched model performance to its best. (We now support converting supernet weight to stand-alone models without retraining.)
Deploy: Test stand-alone models.

Please see Tutorial.md for the basic usage of those steps in iNAS.

Model Zoo

Pre-trained models and log examples are available in ModelZoo.md.

TODO List

Support multi-processing search (simply use data-parallel cannot increase search speed).
Complete documentations.
Add some applications.

Citation

If you find this project useful in your research, please consider cite:

@inproceedings{gu2021inas,
  title={iNAS: Integral NAS for Device-Aware Salient Object Detection},
  author={Gu, Yu-Chao and Gao, Shang-Hua and Cao, Xu-Sheng and Du, Peng and Lu, Shao-Ping and Cheng, Ming-Ming},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  pages={4934--4944},
  year={2021}
}

License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (cc-by-nc-sa), where only non-commercial usage is allowed. For commercial usage, please contact us.

Acknowledgement

The project structure is borrowed from BasicSR, and parts of implementation and evaluation codes are borrowed from Once-For-All, BASNet and BiSeNet . Thanks for these excellent projects.

Contact

If you have any questions, please email [email protected].

iNAS: Integral NAS for Device-Aware Salient Object Detection

Related tags

Overview

iNAS: Integral NAS for Device-Aware Salient Object Detection

Introduction

Updates

Dependencies and Installation

Dataset Preparation

How to Use

Model Zoo

TODO List

Citation

License

Acknowledgement

Contact

Owner

顾宇超

PyElecCL - Electron Monte Carlo Second Checks

TrackTech: Real-time tracking of subjects and objects on multiple cameras

The implemetation of Dynamic Nerual Garments proposed in Siggraph Asia 2021

Eff video representation - Efficient video representation through neural fields

Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.

✨✨✨An awesome open source toolbox for stereo matching.

Randomized Correspondence Algorithm for Structural Image Editing

Codes for CyGen, the novel generative modeling framework proposed in "On the Generative Utility of Cyclic Conditionals" (NeurIPS-21)

ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.

Code from the paper "High-Performance Brain-to-Text Communication via Handwriting"

A Python implementation of active inference for Markov Decision Processes

Code release for our paper, "SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo"

Yet Another Reinforcement Learning Tutorial

Realistic lighting in ursina!

Koç University deep learning framework.

GANTheftAuto is a fork of the Nvidia's GameGAN

PyTorch Implementation of Temporal Output Discrepancy for Active Learning, ICCV 2021

Code for Contrastive-Geometry Networks for Generalized 3D Pose Transfer

Official PyTorch Implementation of Mask-aware IoU and maYOLACT Detector [BMVC2021]

ADGAN - The Implementation of paper Controllable Person Image Synthesis with Attribute-Decomposed GAN