Official Pytorch implementation of "DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network" (CVPR'21)

Last update: Nov 22, 2022

Related tags

Overview

DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network

Pytorch implementation for our DivCo. We propose a simple yet effective regularization term named latent-augmented contrastive loss that can be applied to arbitrary conditional generative adversarial networks in different tasks to alleviate the mode collapse issue and improve the diversity.

Contact: Rui Liu ([email protected])

Paper

DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network
Rui Liu, Yixiao Ge, Ching Lam Choi, Xiaogang Wang, and Hongsheng Li
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
[arxiv]

Citing DivCo

If you find DivCo useful in your research, please consider citing:

@inproceedings{Liu_DivCo,
  author = {Liu, Rui and Ge, Yixiao and Choi, Ching Lam and Wang, Xiaogang and Li, Hongsheng},
  booktitle = {IEEE Conference on Computer Vision and Pattern Recognition},
  title = {DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network},
  year = {2021}
}

Framework

Usage

Prerequisites

Python >= 3.6
Pytorch >= 0.4.0 and corresponding torchvision (https://pytorch.org/)

Install

Clone this repo:

git clone https://github.com/ruiliu-ai/DivCo.git

Training Examples

Download datasets for each task into the dataset folder

mkdir datasets

Label-conditoned Image Generation

Dataset: CIFAR-10
Baseline: DCGAN

cd DivCo/DivCo-DCGAN
python train.py --dataroot ./datasets/Cifar10

Paried Image-to-image Translation

Paired Data: facades and maps
Baseline: BicycleGAN

You can download the facades and maps datasets from the BicycleGAN [Github Project].
We employ the network architecture of the BicycleGAN and follow its training process.

cd DivCo/DivCo-BicycleGAN
python train.py --dataroot ./datasets/facades

Unpaired Image-to-image Translation

Unpaired Data: Yosemite (summer <-> winter) and Cat2Dog (cat <-> dog)
Baseline: DRIT

You can download the datasets from the DRIT [Github Project].
Specify --concat 0 for Cat2Dog to handle large shape variation translation

cd DivCo/DivCo-DRIT
python train.py --dataroot ./datasets/cat2dog --concat 0 --lambda_contra 0.1
python train.py --dataroot ./datasets/yosemite --concat 1 --lambda_contra 1.0

Pre-trained Models

Download and save them into

./models/

Evaluation

For BicycleGAN, DRIT and MSGAN, please follow the instructions of corresponding github projects of the baseline frameworks for more evaluation details.

Testing Examples

DivCo-DCGAN

python test.py --dataroot ./datasets/Cifar10 --resume ./models/DivCo-DCGAN/00199.pth

DivCo-BicycleGAN

python test.py --dataroot ./datasets/facades --checkpoints_dir ./models/DivCo-BicycleGAN/facades --epoch 400

python test.py --dataroot ./datasets/maps --checkpoints_dir ./models/DivCo-BicycleGAN/maps --epoch 400

DivCo-DRIT

python test.py --dataroot ./datasets/yosemite --resume ./models/DivCo-DRIT/yosemite/01199.pth --concat 1

python test.py --dataroot ./datasets/cat2dog --resume ./models/DivCo-DRIT/cat2dog/01199.pth --concat 0

Official Pytorch implementation of "DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network" (CVPR'21)

Related tags

Overview

DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network

Paper

Citing DivCo

Framework

Usage

Prerequisites

Install

Training Examples

Label-conditoned Image Generation

Paried Image-to-image Translation

Unpaired Image-to-image Translation

Pre-trained Models

Evaluation

Testing Examples

Reference

Quantitative Evaluation Metrics

Owner

This repository provides the official code for GeNER (an automated dataset Generation framework for NER).

Official implementation of the NeurIPS'21 paper 'Conditional Generation Using Polynomial Expansions'.

SMCA replication There are no extra compiled components in SMCA DETR and package dependencies are minimal

WRENCH: Weak supeRvision bENCHmark

Code for the paper "Learning-Augmented Algorithms for Online Steiner Tree"

Data labels and scripts for fastMRI.org

Direct application of DALLE-2 to video synthesis, using factored space-time Unet and Transformers

For the paper entitled ''A Case Study and Qualitative Analysis of Simple Cross-Lingual Opinion Mining''

This repository contains the source code for the paper Tutorial on amortized optimization for learning to optimize over continuous domains by Brandon Amos

Code for PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning

Analysis of Smiles through reservoir sampling & RDkit

Deep Learning Pipelines for Apache Spark

Ludwig is a toolbox that allows to train and evaluate deep learning models without the need to write code.

Towards uncontrained hand-object reconstruction from RGB videos

Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"

Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.

Official implementation of "Watermarking Images in Self-Supervised Latent-Spaces"

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

《DeepViT: Towards Deeper Vision Transformer》(2021)

The pyrelational package offers a flexible workflow to enable active learning with as little change to the models and datasets as possible