The implementation of 'Image synthesis via semantic composition'.

Last update: Jan 06, 2023

Related tags

Overview

Image synthesis via semantic synthesis [Project Page]

by Yi Wang, Lu Qi, Ying-Cong Chen, Xiangyu Zhang, Jiaya Jia.

Introduction

This repository gives the implementation of our semantic image synthesis method in ICCV 2021 paper, 'Image synthesis via semantic synthesis'.

Our framework

Usage

git clone https://github.com/dvlab-research/SCGAN.git
cd SCGAN/code

To use this code, please install PyTorch 1.0 and Python 3+. Other dependencies can be installed by

pip install -r requirements.txt

Dataset Preparation

Please refer to SPADE for detailed execution.

Testing

Downloading pretrained models, then putting the folder containing model weights in the folder ./checkpoints.
Producing images with the pretrained models.

python test.py --gpu_ids 0,1,2,3 --dataset_mode [dataset] --config config/scgan_[dataset]_test.yml --fid --gt [gt_path] --visual_n 1

For example,

python test.py --gpu_ids 0,1,2,3 --dataset_mode celeba --config config/scgan_celeba-test.yml --fid --gt /data/datasets/celeba --visual_n 1

Visual results are stored at ./results/scgan_[dataset]/ by default.

Pretrained Models (to be updated)

Dataset	Download link
CelebAMask-HQ	Baidu Disk (Code: face)

Training

Using train.sh to train new models. Or you can specify training options in config/[config_file].yml.

Key operators

Our proposed dynamic computation units (spatial conditional convolution and normalization) are extended from conditionally parameterized convolutions [1]. We generalize the scalar condition into a spatial one and also apply these techniques to normalization.

Citation

If our research is useful for you, please consider citing:

@inproceedings{wang2021image,
  title={Image Synthesis via Semantic Composition},
  author={Wang, Yi and Qi, Lu and Chen, Ying-Cong and Zhang, Xiangyu and Jia, Jiaya},
  booktitle={ICCV},
  year={2021}
}

Acknowledgements

This code is built upon SPADE, Imaginaire, and PyTorch-FID.

Reference

[1] Brandon Yang, Gabriel Bender, Quoc V Le, and Jiquan Ngiam. Condconv: Conditionally parameterized convolutions for efficient inference. In NeurIPS. 2019.

Contact

Please send email to [email protected].

The implementation of 'Image synthesis via semantic composition'.

Related tags

Overview

Image synthesis via semantic synthesis [Project Page]

Introduction

Our framework

Usage

Dataset Preparation

Testing

Pretrained Models (to be updated)

Training

Key operators

Citation

Acknowledgements

Reference

Contact

Owner

DV Lab

Codes for our paper The Stem Cell Hypothesis: Dilemma behind Multi-Task Learning with Transformer Encoders published to EMNLP 2021.

DANA paper supplementary materials

StyleSwin: Transformer-based GAN for High-resolution Image Generation

Official pytorch implement for “Transformer-Based Source-Free Domain Adaptation”

DANet for Tabular data classification/ regression.

Caffe-like explicit model constructor. C(onfig)Model

Unofficial PyTorch implementation of MobileViT based on paper "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer".

Implementation for paper MLP-Mixer: An all-MLP Architecture for Vision

The source codes for TME-BNA: Temporal Motif-Preserving Network Embedding with Bicomponent Neighbor Aggregation.

This is an official implementation for "PlaneRecNet".

GUPNet - Geometry Uncertainty Projection Network for Monocular 3D Object Detection

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Lingvo is a framework for building neural networks in Tensorflow, particularly sequence models.

Text-to-Image generation

Official PyTorch implementation of our AAAI22 paper: TransMEF: A Transformer-Based Multi-Exposure Image Fusion Framework via Self-Supervised Multi-Task Learning. Code will be available soon.

ilpyt: imitation learning library with modular, baseline implementations in Pytorch

Bringing Computer Vision and Flutter together , to build an awesome app !!

Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.

Keras implementation of "One pixel attack for fooling deep neural networks" using differential evolution on Cifar10 and ImageNet

Research - dataset and code for 2016 paper Learning a Driving Simulator