Spatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)

Last update: Dec 01, 2022

Overview

Spatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)

Tianyu Wang*, Xin Yang*, Ke Xu, Shaozhe Chen, Qiang Zhang, Rynson W.H. Lau † (* Joint first author. † Rynson Lau is the corresponding author.)

[Arxiv]

Abstract

Removing rain streaks from a single image has been drawing considerable attention as rain streaks can severely degrade the image quality and affect the performance of existing outdoor vision tasks. While recent CNN-based derainers have reported promising performances, deraining remains an open problem for two reasons. First, existing synthesized rain datasets have only limited realism, in terms of modeling real rain characteristics such as rain shape, direction and intensity. Second, there are no public benchmarks for quantitative comparisons on real rain images, which makes the current evaluation less objective. The core challenge is that real world rain/clean image pairs cannot be captured at the same time. In this paper, we address the single image rain removal problem in two ways. First, we propose a semi-automatic method that incorporates temporal priors and human supervision to generate a high-quality clean image from each input sequence of real rain images. Using this method, we construct a large-scale dataset of ∼29.5K rain/rain-free image pairs that cover a wide range of natural rain scenes. Second, to better cover the stochastic distributions of real rain streaks, we propose a novel SPatial Attentive Network (SPANet) to remove rain streaks in a local-to-global manner. Extensive experiments demonstrate that our network performs favorably against the state-of-the-art deraining methods.

Citation

If you use this code or our dataset(including test set), please cite:

@InProceedings{Wang_2019_CVPR,
  author = {Wang, Tianyu and Yang, Xin and Xu, Ke and Chen, Shaozhe and Zhang, Qiang and Lau, Rynson W.H.},
  title = {Spatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset},
  booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  month = {June},
  year = {2019}
}

Dataset

See my personal site

UPDATE We release the code of clean image generation. We also provide some synthesize and real video examples for researchers to try. Note that we only implemented the code using 8 threads.

Requirements

PyTorch == 0.4.1 (1.0.x may not work for training)
cupy (Installation Guide)
opencv-python
TensorBoardX
Python3.6
progressbar2
scikit-image
ffmpeg >= 4.0.1
python-ffmpeg

Setup

Clone this repo:

$ git clone ...
$ cd SPANet

Train & Test

Train:

Download the dataset(~44GB) and unpack it into code folder (See details in Train_Dataset_README.md). Then, run:

$ python main.py -a train -m latest

Test:

Download the test dataset(~455MB) and unpack it into code folder (See details in Test_Dataset_README.md). Then, run:

$ python main.py -a test -m latest

Performance Change

PSNR 38.02 -> 38.53

SSIM 0.9868 -> 0.9875

For generalization, we here stop at 40K steps.

All PSNR and SSIM of results are computed by using skimage.measure. Please use this to evaluate your works.

License

Please see License.txt file.

Acknowledgement

Code borrows from RESCAN by Xia Li. The CUDA extension references pyinn by Sergey Zagoruyko and DSC(CF-Caffe) by Xiaowei Hu. Thanks for sharing!

Contact

E-Mail: [email protected]

Spatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)

Related tags

Overview

Spatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)

Abstract

Citation

Dataset

Requirements

Setup

Train & Test

Performance Change

License

Acknowledgement

Contact

Owner

Steve Wong

Gray Zone Assessment

Dataset VSD4K includes 6 popular categories: game, sport, dance, vlog, interview and city.

Simple keras FCN Encoder/Decoder model for MS-COCO (food subset) segmentation

Angular & Electron desktop UI framework. Angular components for native looking and behaving macOS desktop UI (Electron/Web)

Noether Networks: meta-learning useful conserved quantities

Source Code of NeurIPS21 paper: Recognizing Vector Graphics without Rasterization

[ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets"

Airborne magnetic data of the Osborne Mine and Lightning Creek sill complex, Australia

A framework to train language models to learn invariant representations.

PanopticBEV - Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images

Neural Point-Based Graphics

Code for sound field predictions in domains with impedance boundaries. Used for generating results from the paper

CondLaneNet: a Top-to-down Lane Detection Framework Based on Conditional Convolution

[AI6101] Introduction to AI & AI Ethics is a core course of MSAI, SCSE, NTU, Singapore

We simulate traveling back in time with a modern camera to rephotograph famous historical subjects.

PyTorch Implementation of DSB for Score Based Generative Modeling. Experiments managed using Hydra.

Code for Multiple Instance Active Learning for Object Detection, CVPR 2021

Tweesent-back - Tweesent backend uses fastAPI as the web framework

FL-WBC: Enhancing Robustness against Model Poisoning Attacks in Federated Learning from a Client Perspective

Codes to pre-train T5 (Text-to-Text Transfer Transformer) models pre-trained on Japanese web texts