Implementation for "Exploiting Aliasing for Manga Restoration" (CVPR 2021)

Last update: Dec 15, 2022

Related tags

Overview

[CVPR Paper](To appear) | [Project Website](To appear) | BibTex

Introduction

As a popular entertainment art form, manga enriches the line drawings details with bitonal screentones. However, manga resources over the Internet usually show screentone artifacts because of inappropriate scanning/rescaling resolution. In this paper, we propose an innovative two-stage method to restore quality bitonal manga from degraded ones. Our key observation is that the aliasing induced by downsampling bitonal screentones can be utilized as informative clues to infer the original resolution and screentones. First, we predict the target resolution from the degraded manga via the Scale Estimation Network (SE-Net) with spatial voting scheme. Then, at the target resolution, we restore the region-wise bitonal screentones via the Manga Restoration Network (MR-Net) discriminatively, depending on the degradation degree. Specifically, the original screentones are directly restored in pattern-identifiable regions, and visually plausible screentones are synthesized in pattern-agnostic regions. Quantitative evaluation on synthetic data and visual assessment on real-world cases illustrate the effectiveness of our method.

Example Results

Belows shows an example of our restored manga image. The image comes from the Manga109 dataset.

Pretrained models

Download the models below and put it under release_model/.

MangaRestoration

Run

Requirements:
- Install python3.6
- Install pytorch (tested on Release 1.1.0)
Testing:
- Place your test images under datazip/manga1/test.
- Prepare images filelist using flist.py.
- Modify manga.json to set path to data.
- Run python testreal.py -c [config_file] -n [model_name] -s [image_size] .
- For example, python testreal.py -c configs/manga.json -n resattencv -s 256
- You can also use python testreal.py -c [config_file] -n [model_name] -s [image_size] -sl [scale] to specify the scale factor.
- Note that the Convex interpolation refinement requires large GPU memory, you can enable it by setting (bilinear=False) in MangaRestorator to restore images. Defaultly, we set bilinear=True.

Citation

If any part of our paper and code is helpful to your work, please generously cite with:

@inproceedings{xie2021exploiting,
  author = {Minshan Xie and Menghan Xia and Tien-Tsin Wong},
  title = {Exploiting Aliasing for Manga Restoration},
  booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2021}
}

Implementation for "Exploiting Aliasing for Manga Restoration" (CVPR 2021)

Related tags

Overview

[CVPR Paper](To appear) | [Project Website](To appear) | BibTex

Introduction

Example Results

Pretrained models

Run

Citation

Reference

Owner

learned_optimization: Training and evaluating learned optimizers in JAX

Sign Language Transformers (CVPR'20)

Global-Local Attention for Emotion Recognition

This is the repository for our paper Ditch the Gold Standard: Re-evaluating Conversational Question Answering

Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing

We evaluate our method on different datasets (including ShapeNet, CUB-200-2011, and Pascal3D+) and achieve state-of-the-art results, outperforming all the other supervised and unsupervised methods and 3D representations, all in terms of performance, accuracy, and training time.

Learning Off-Policy with Online Planning, CoRL 2021

Txt2Xml tool will help you convert from txt COCO format to VOC xml format in Object Detection Problem.

Collision risk estimation using stochastic motion models

Hierarchical Few-Shot Generative Models

Repository for the semantic WMI loss

[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias

Distance correlation and related E-statistics in Python

Plugin for Gaffer providing direct acess to asset from PolyHaven.com. Only HDRIs at the moment, Cycles and Arnold supported

Single-Shot Motion Completion with Transformer

A little Python application to auto tag your photos with the power of machine learning.

PASSL包含 SimCLR，MoCo，BYOL，CLIP等基于对比学习的图像自监督算法以及 Vision-Transformer，Swin-Transformer，BEiT，CVT，T2T，MLP_Mixer等视觉Transformer算法

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.

[CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos

A dual benchmarking study of visual forgery and visual forensics techniques