Code to reproduce the results in the paper "Tensor Component Analysis for Interpreting the Latent Space of GANs".

Last update: Jun 17, 2022

Related tags

Deep Learning TCA-latent-space

Overview

Tensor Component Analysis for Interpreting the Latent Space of GANs

[ paper | project page ]

Code to reproduce the results in the paper "Tensor Component Analysis for Interpreting the Latent Space of GANs".

dependencies

Firstly, to install the required packages, please run:

$ pip install -r requirements.txt

Pretrained weights

To replicate the results in the paper, you'll need to first download the pre-trained weights. To do so, simply run this from the command line:

./download_weights.sh

Quantitative results

building the prediction matrices

To reproduce Fig. 5, one can then run the ./quant.ipynb notebook using the pre-computed classification scores (please see this notebook for more details).

manually computing predictions

To call the Microsoft Azure Face API to generate the predictions again from scratch, one can run the shell script in ./quant/classify.sh. Firstly however, you need to generate our synthetic images to classify, which we detail below.

Qualitative results

generating the images

Reproducing the qualitative results (i.e. in Fig. 6) involves generating synthetic faces and 3 edited versions with the 3 attributes of interest (hair colour, yaw, and pitch). To generate these images (which are also used for the quantitative results), simply run:

$ ./generate_quant_edits.sh

mode-wise edits

Manual edits along individual modes of the tensor are made by calling main.py with the --mode edit_modewise flag. For example, one can reproduce the images from Fig. 3 with:

$ python main.py --cp_rank 0 --tucker_ranks "4,4,4,512" --model_name pggan_celebahq1024 --penalty_lam 0.001 --resume_iters 1000
  --n_to_edit 10 \
  --mode edit_modewise \
  --attribute_to_edit male

multilinear edits

Edits achieved with the 'multilinear mixing' are achieved instead by loading the relevant weights and supplying the --mode edit_multilinear flag. For example, the images in Fig. 4 are generated with:

$ python main.py --cp_rank 0 --tucker_ranks "256,4,4,512" --model_name pggan_celebahq1024 --penalty_lam 0.001 --resume_iters 200000
  --n_to_edit 10 \
  --mode edit_multilinear \
  --attribute_to_edit thick

Please feel free to get in touch at: [email protected], where x=oldfield

credits

All the code in ./architectures/ and utils.py is directly imported from https://github.com/genforce/genforce, only lightly modified to support performing the forward pass through the models partially, and returning the intermediate tensors.

The structure of the codebase follows https://github.com/yunjey/stargan, and hence we use their code as a template to build off. For this reason, you will find small helper functions (e.g. the first few lines of main.py) are borrowed from the StarGAN codebase.

Code to reproduce the results in the paper "Tensor Component Analysis for Interpreting the Latent Space of GANs".

Related tags

Overview

Tensor Component Analysis for Interpreting the Latent Space of GANs

[ paper | project page ]

dependencies

Pretrained weights

Quantitative results

building the prediction matrices

manually computing predictions

Qualitative results

generating the images

mode-wise edits

multilinear edits

credits

Owner

James Oldfield

Adversarial Learning for Semi-supervised Semantic Segmentation, BMVC 2018

MQBench Quantization Aware Training with PyTorch

A simple image/video to Desmos graph converter run locally

Official Pytorch implementation for AAAI2021 paper (RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning)

Keyword-BERT: Keyword-Attentive Deep Semantic Matching

Code for the ECIR'22 paper "Evaluating the Robustness of Retrieval Pipelines with Query Variation Generators"

A library that allows for inference on probabilistic models

Colossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training

A framework for joint super-resolution and image synthesis, without requiring real training data

An example to implement a new backbone with OpenMMLab framework.

Generative Handwriting using LSTM Mixture Density Network with TensorFlow

City Surfaces: City-scale Semantic Segmentation of Sidewalk Surfaces

Code for our SIGCOMM'21 paper "Network Planning with Deep Reinforcement Learning".

Automatically replace ONNX's RandomNormal node with Constant node.

Code to reproduce the results in "Visually Grounded Reasoning across Languages and Cultures", EMNLP 2021.

Implementation of Research Paper "Learning to Enhance Low-Light Image via Zero-Reference Deep Curve Estimation"

This is a yolo3 implemented via tensorflow 2.7

Piotr - IoT firmware emulation instrumentation for training and research

This is a TensorFlow implementation for C2-Rec

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers