[CVPR'20] TTSR: Learning Texture Transformer Network for Image Super-Resolution

Last update: Dec 28, 2022

Related tags

Overview

TTSR

Official PyTorch implementation of the paper Learning Texture Transformer Network for Image Super-Resolution accepted in CVPR 2020.

Introduction
Requirements and dependencies
Model
Quick test
Dataset prepare
Evaluation
Train
Citation
Contact

Introduction

We proposed an approach named TTSR for RefSR task. Compared to SISR, RefSR has an extra high-resolution reference image whose textures can be utilized to help super-resolve low-resolution input.

Contribution

We are one of the first to introduce the transformer architecture into image generation tasks. More specifically, we propose a texture transformer with four closely-related modules for image SR which achieves significant improvements over SOTA approaches.
We propose a novel cross-scale feature integration module for image generation tasks which enables our approach to learn a more powerful feature representation by stacking multiple texture transformers.

Approach overview

Main results

Requirements and dependencies

python 3.7 (recommend to use Anaconda)
python packages: pip install opencv-python imageio
pytorch >= 1.1.0
torchvision >= 0.4.0

Model

Pre-trained models can be downloaded from onedrive, baidu cloud(0u6i), google drive.

TTSR-rec.pt: trained with only reconstruction loss
TTSR.pt: trained with all losses

Quick test

Clone this github repo

git clone https://github.com/FuzhiYang/TTSR.git
cd TTSR

Download pre-trained models and modify "model_path" in test.sh
Run test

sh test.sh

The results are in "save_dir" (default: ./test/demo/output)

Dataset prepare

Download CUFED train set and CUFED test set
Make dataset structure be:

CUFED
- train
  - input
  - ref
- test
  - CUFED5

Evaluation

Prepare CUFED dataset and modify "dataset_dir" in eval.sh
Download pre-trained models and modify "model_path" in eval.sh
Run evaluation

sh eval.sh

The results are in "save_dir" (default: ./eval/CUFED/TTSR)

Train

Prepare CUFED dataset and modify "dataset_dir" in train.sh
Run training

sh train.sh

The training results are in "save_dir" (default: ./train/CUFED/TTSR)

Citation

@InProceedings{yang2020learning,
author = {Yang, Fuzhi and Yang, Huan and Fu, Jianlong and Lu, Hongtao and Guo, Baining},
title = {Learning Texture Transformer Network for Image Super-Resolution},
booktitle = {CVPR},
year = {2020},
month = {June}
}

Contact

If you meet any problems, please describe them in issues or contact:

Fuzhi Yang: [email protected]

[CVPR'20] TTSR: Learning Texture Transformer Network for Image Super-Resolution

Related tags

Overview

TTSR

Contents

Introduction

Contribution

Approach overview

Main results

Requirements and dependencies

Model

Quick test

Dataset prepare

Evaluation

Train

Citation

Contact

Owner

Multimedia Research

Data manipulation and transformation for audio signal processing, powered by PyTorch

CondNet: Conditional Classifier for Scene Segmentation

Information Gain Filtration (IGF) is a method for filtering domain-specific data during language model finetuning. IGF shows significant improvements over baseline fine-tuning without data filtration.

This project is the official implementation of our accepted ICLR 2021 paper BiPointNet: Binary Neural Network for Point Clouds.

Project ArXiv Citation Network

Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"

disentanglement_lib is an open-source library for research on learning disentangled representations.

PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)

Doosan robotic arm, simulation, control, visualization in Gazebo and ROS2 for Reinforcement Learning.

Heart Arrhythmia Classification

Code for ICCV 2021 paper: ARAPReg: An As-Rigid-As Possible Regularization Loss for Learning Deformable Shape Generators..

A set of tests for evaluating large-scale algorithms for Wasserstein-2 transport maps computation.

MGFN: Multi-Graph Fusion Networks for Urban Region Embedding was accepted by IJCAI-2022.

A library for efficient similarity search and clustering of dense vectors.

Chinese license plate recognition

(CVPR2021) ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic

Causal Imitative Model for Autonomous Driving

DECAF: Deep Extreme Classification with Label Features

Machine Learning in Asset Management (by @firmai)

My 1st place solution at Kaggle Hotel-ID 2021