The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".

Last update: May 28, 2022

Overview

Energy-based Conditional Generative Adversarial Network (ECGAN)

This is the code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers". The repository is modified from StudioGAN. If you find our work useful, please consider citing the following paper:

@inproceedings{chen2021ECGAN,
  title   = {A Unified View of cGANs with and without Classifiers},
  author  = {Si-An Chen and Chun-Liang Li and Hsuan-Tien Lin},
  booktitle = {Advances in Neural Information Processing Systems},
  year    = {2021}
}

Please feel free to contact Si-An Chen if you have any questions about the code/paper.

Introduction

We propose a new Conditional Generative Adversarial Network (cGAN) framework called Energy-based Conditional Generative Adversarial Network (ECGAN) which provides a unified view of cGANs and achieves state-of-the-art results. We use the decomposition of the joint probability distribution to connect the goals of cGANs and classification as a unified framework. The framework, along with a classic energy model to parameterize distributions, justifies the use of classifiers for cGANs in a principled manner. It explains several popular cGAN variants, such as ACGAN, ProjGAN, and ContraGAN, as special cases with different levels of approximations. An illustration of the framework is shown below.

Requirements

Anaconda
Python >= 3.6
6.0.0 <= Pillow <= 7.0.0
scipy == 1.1.0 (Recommended for fast loading of Inception Network)
sklearn
seaborn
h5py
tqdm
torch >= 1.6.0 (Recommended for mixed precision training and knn analysis)
torchvision >= 0.7.0
tensorboard
5.4.0 <= gcc <= 7.4.0 (Recommended for proper use of adaptive discriminator augmentation module)

You can install the recommended environment as follows:

conda env create -f environment.yml -n studiogan

With docker, you can use:

docker pull mgkang/studiogan:0.1

Quick Start

Train (-t) and evaluate (-e) the model defined in CONFIG_PATH using GPU 0

CUDA_VISIBLE_DEVICES=0 python3 src/main.py -t -e -c CONFIG_PATH

Train (-t) and evaluate (-e) the model defined in CONFIG_PATH using GPUs (0, 1, 2, 3) and DataParallel

CUDA_VISIBLE_DEVICES=0,1,2,3 python3 src/main.py -t -e -c CONFIG_PATH

Try python3 src/main.py to see available options.

Dataset

CIFAR10: StudioGAN will automatically download the dataset once you execute main.py.
Tiny Imagenet, Imagenet, or a custom dataset:
1. download Tiny Imagenet and Imagenet. Prepare your own dataset.
2. make the folder structure of the dataset as follows:

┌── docs
├── src
└── data
    └── ILSVRC2012 or TINY_ILSVRC2012 or CUSTOM
        ├── train
        │   ├── cls0
        │   │   ├── train0.png
        │   │   ├── train1.png
        │   │   └── ...
        │   ├── cls1
        │   └── ...
        └── valid
            ├── cls0
            │   ├── valid0.png
            │   ├── valid1.png
            │   └── ...
            ├── cls1
            └── ...

Examples and Results

The src/configs directory contains config files used in our experiments.

CIFAR10 (3x32x32)

To train and evaluate ECGAN-UC on CIFAR10:

python3 src/main.py -t -e -c src/configs/CIFAR10/ecgan_v2_none_0_0p01.json

Method	Reference	IS(⭡)	FID(⭣)	F_1/8(⭡)	F_8(⭡)	Cfg	Log	Weights
BigGAN-Mod	StudioGAN	9.746	8.034	0.995	0.994	-	-	-
ContraGAN	StudioGAN	9.729	8.065	0.993	0.992	-	-	-
Ours	-	10.078	7.936	0.990	0.988	Cfg	Log	Link

Tiny ImageNet (3x64x64)

To train and evaluate ECGAN-UC on Tiny ImageNet:

python3 src/main.py -t -e -c src/configs/TINY_ILSVRC2012/ecgan_v2_none_0_0p01.json --eval_type valid

Method	Reference	IS(⭡)	FID(⭣)	F_1/8(⭡)	F_8(⭡)	Cfg	Log	Weights
BigGAN-Mod	StudioGAN	11.998	31.92	0.956	0.879	-	-	-
ContraGAN	StudioGAN	13.494	27.027	0.975	0.902	-	-	-
Ours	-	18.445	18.319	0.977	0.973	Cfg	Log	Link

ImageNet (3x128x128)

To train and evaluate ECGAN-UCE on ImageNet (~12 days on 8 NVIDIA V100 GPUs):

python3 src/main.py -t -e -l -sync_bn -c src/configs/ILSVRC2012/imagenet_ecgan_v2_contra_1_0p05.json --eval_type valid

Method	Reference	IS(⭡)	FID(⭣)	F_1/8(⭡)	F_8(⭡)	Cfg	Log	Weights
BigGAN	StudioGAN	28.633	24.684	0.941	0.921	-	-	-
ContraGAN	StudioGAN	25.249	25.161	0.947	0.855	-	-	-
Ours	-	80.685	8.491	0.984	0.985	Cfg	Log	Link

Generated Images

Here are some selected images generated by ECGAN.

The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".

Related tags

Overview

Energy-based Conditional Generative Adversarial Network (ECGAN)

Introduction

Requirements

Quick Start

Dataset

Examples and Results

CIFAR10 (3x32x32)

Tiny ImageNet (3x64x64)

ImageNet (3x128x128)

Generated Images

Owner

sianchen

RobustVideoMatting and background composing in one model by using onnxruntime.

Duke Machine Learning Winter School: Computer Vision 2022

kapre: Keras Audio Preprocessors

《Geo Word Clouds》paper implementation

Pydantic models for pywttr and aiopywttr.

Supervised domain-agnostic prediction framework for probabilistic modelling

[ICCV21] Official implementation of the "Social NCE: Contrastive Learning of Socially-aware Motion Representations" in PyTorch.

The official implementation of "Rethink Dilated Convolution for Real-time Semantic Segmentation"

The tl;dr on a few notable transformer/language model papers + other papers (alignment, memorization, etc).

Automatic labeling, conversion of different data set formats, sample size statistics, model cascade

Nicely is a real-time Feedback and Intervention Program Depression is a prevalent issue across all age groups, socioeconomic classes, and cultural identities.

基于YoloX目标检测+DeepSort算法实现多目标追踪Baseline

A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset.

This is the offical website for paper ''Category-consistent deep network learning for accurate vehicle logo recognition''

A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

ARAE-Tensorflow for Discrete Sequences (Adversarially Regularized Autoencoder)

Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Code for KHGT model, AAAI2021

[ICRA 2022] CaTGrasp: Learning Category-Level Task-Relevant Grasping in Clutter from Simulation

Official implementation of ACMMM'20 paper 'Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework'