CR-Fill: Generative Image Inpainting with Auxiliary Contextual Reconstruction. ICCV 2021

Last update: Dec 20, 2022

Overview

crfill

code for paper ``CR-Fill: Generative Image Inpainting with Auxiliary Contextual Reconstruction". This repo (including code and models) are for research purposes only.

Usage

Dependencies

Download code

git clone --single-branch https://github.com/zengxianyu/crfill
git submodule init
git submodule update

Download data and model

chmod +x download/*
./download/download_model.sh
./download/download_datal.sh

Install dependencies:

conda env create -f environment.yml

or install these packages manually in a Python 3.6 enviroment:

pytorch=1.3.1, opencv=3.4.2, tqdm, torchvision, dill, matplotlib, opencv

Inference

./test.sh

These script will run the inpainting model on the samples I provided. Modify the options --image_dir, --mask_dir, --output_dir in test.sh to test on custom data.

Train

Prepare training datasets and put them in ./datasets/ following the example ./datasets/places
run the training script:

./train.sh

open the html files in ./output to visualize training

After the training is finished, the model files can be found in ./checkpoints/debugarr0

you may modify the training script to use different settings, e.g., batch size, hyperparameters

Finetune

For finetune on custom dataset based on my pretrained models, use the following command:

download checkpoints

./download/download_pretrain.sh

run the training script

./finetune.sh

you may change the options in finetune.sh to use different hyperparameters or your own dataset

Web APP

To use the web app, these additional packages are required:

flask, requests, pillow

./demo.sh

then open http://localhost:2334 in the browser to use the web app

Citing

@inproceedings{zeng2021generative,
  title={CR-Fill: Generative Image Inpainting with Auxiliary Contextual Reconstruction},
  author={Zeng, Yu and Lin, Zhe and Lu, Huchuan and Patel, Vishal M.},
  booktitle={Proceedings of the IEEE International Conference on Computer Vision},
  year={2021}
}

Acknowledgement

DeepFill https://github.com/jiahuiyu/generative_inpainting
Pix2PixHD https://github.com/NVIDIA/pix2pixHD
SPADE https://github.com/NVlabs/SPADE

CR-Fill: Generative Image Inpainting with Auxiliary Contextual Reconstruction. ICCV 2021

Related tags

Overview

crfill

Usage

Dependencies

Inference

Train

Finetune

Web APP

Citing

Acknowledgement

Owner

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Implementation of the paper: "SinGAN: Learning a Generative Model from a Single Natural Image"

Implementation of "StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis"

Morphable Detector for Object Detection on Demand

Official PyTorch implementation of the paper: Improving Graph Neural Network Expressivity via Subgraph Isomorphism Counting.

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Machine Learning automation and tracking

Music source separation is a task to separate audio recordings into individual sources

Detecting drunk people through thermal images using Deep Learning (CNN)

Light-weight network, depth estimation, knowledge distillation, real-time depth estimation, auxiliary data.

Smart edu-autobooking - Johnson @ DMI-UNICT study room self-booking system

Non-Official Pytorch implementation of "Face Identity Disentanglement via Latent Space Mapping" https://arxiv.org/abs/2005.07728 Using StyleGAN2 instead of StyleGAN

Coarse implement of the paper "A Simultaneous Denoising and Dereverberation Framework with Target Decoupling", On DNS-2020 dataset, the DNSMOS of first stage is 3.42 and second stage is 3.47.

Lacmus is a cross-platform application that helps to find people who are lost in the forest using computer vision and neural networks.

基于pytorch构建cyclegan示例

Code for paper: Group-CAM: Group Score-Weighted Visual Explanations for Deep Convolutional Networks

Starter Code for VALUE benchmark

The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".

HTSeq is a Python library to facilitate processing and analysis of data from high-throughput sequencing (HTS) experiments.

Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning"