Detail-Preserving Transformer for Light Field Image Super-Resolution

Last update: Jan 01, 2023

Related tags

Overview

DPT

Official Pytorch implementation of the paper "Detail-Preserving Transformer for Light Field Image Super-Resolution" accepted by AAAI 2022 .

Updates

2022.01: Our method is available at the newly-released repository BasicLFSR, an open-source and easy-to-use toolbox for LF image SR.
2022.01: The code is released.

Requirements

Python 3.7.7
Pytorch=1.5.0
torchvision=0.6.0
h5py=2.8.0
Matlab

Dataset

We use the EPFL, HCInew, HCIold, INRIA and STFgantry datasets for both training and testing. You can download the above dataset from Baidu Drive (key:912V).

Download the visual results

We share the super-resolved results generated by our DPT. Then, researchers can compare their methods to our DPT without performing inference. Results are available at Baidu Drive (key:912V).

Prepare the datasets

To generate the training data,

 Using Matlab to run `GenerateTrainingData.m`

To generate the testing data,

 Using Matlab to run `GenerateTestData.m`

We also provide the processed datasets we used in the paper. The processed datasets are avaliable at Baidu Drive (key:912V).

Train

To perform DPT training, please run

python train.py

Checkpoint will be saved to ./log/.

Test

To evaluate DPT performance, please run

python test.py

The performance of DPT on five datasets will be printed on the screen. The visual result of each scene will be saved in ./Results/. The PSNR and SSIM values of each scene will aslo be saved in ./PSNRSSIM/.

Generate visual results

To generate the visual super-resolved results,

Using Matlab to run `GenerateResultImages.m`

The '.mat' files in ./Results/ will be converted to '.png' images to ./SRimages/.

To generate the visual gradient results, please run

python generate_visual_gradient_map.py

Gradient results will be saved to ./GRAimages/.

Citation

If you find this work helpful, please consider citing the following paper:

@article{wang2022detail,
  title={Detail Preserving Transformer for Light Field Image Super-Resolution},
  author={Wang, Shunzhou and Zhou, Tianfei and Lu, Yao and Di, Huijun},
  journal={arXiv preprint arXiv:2201.00346},
  year={2022}
}

Acknowledgements

This code is heavily based on LF-DFNet. We also refer to the codes in VSR-Transformer, COLA-Net, and SPSR. We thank the authors for sharing the codes. We would like to thank Yingqian Wang for his help with LFSR. We would also like to thank Zhengyu Liang for adding our DPT to the repository BasicLFSR.

Contact

If you have any question about this work, feel free to concat with me via [email protected].

Detail-Preserving Transformer for Light Field Image Super-Resolution

Related tags

Overview

DPT

Updates

Requirements

Dataset

Download the visual results

Prepare the datasets

Train

Test

Generate visual results

Citation

Acknowledgements

Contact

Owner

Implementation of our NeurIPS 2021 paper "A Bi-Level Framework for Learning to Solve Combinatorial Optimization on Graphs".

Code for the paper "Generative design of breakwaters usign deep convolutional neural network as a surrogate model"

A robust camera and Lidar fusion based velocity estimator to undistort the pointcloud.

A curated list of Machine Learning and Deep Learning tutorials in Jupyter Notebook format ready to run in Google Colaboratory

A robotic arm that mimics hand movement through MediaPipe tracking.

This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.

This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 2020

Low-dose Digital Mammography with Deep Learning

Code for the ECCV2020 paper "A Differentiable Recurrent Surface for Asynchronous Event-Based Data"

CCNet: Criss-Cross Attention for Semantic Segmentation (TPAMI 2020 & ICCV 2019).

A small demonstration of using WebDataset with ImageNet and PyTorch Lightning

Code for the paper "Adversarial Generator-Encoder Networks"

PyTorch Personal Trainer: My framework for deep learning experiments

Code for "Single-view robot pose and joint angle estimation via render & compare", CVPR 2021 (Oral).

Styled text-to-drawing synthesis method. Featured at the 2021 NeurIPS Workshop on Machine Learning for Creativity and Design

Official repository for "Intriguing Properties of Vision Transformers" (2021)

OpenAi's gym environment wrapper to vectorize them with Ray

Python PID Tuner - Based on a FOPDT model obtained using a Open Loop Process Reaction Curve

In this project, we develop a face recognize platform based on MTCNN object-detection netcwork and FaceNet self-supervised network.

Tom-the-AI - A compound artificial intelligence software for Linux systems.