TensorFlow CNN for fast style transfer

Last update: Dec 14, 2021

Overview

Fast Style Transfer in TensorFlow

Add styles from famous paintings to any photo in a fraction of a second!

It takes 100ms on a 2015 Titan X to style the MIT Stata Center (1024×680) like Udnie, by Francis Picabia.

Our implementation is based off of a combination of Gatys' A Neural Algorithm of Artistic Style, Johnson's Perceptual Losses for Real-Time Style Transfer and Super-Resolution, and Ulyanov's Instance Normalization. THe repository i based on https://github.com/lengstrom/fast-style-transfer.git.

Image Stylization

We added styles from various paintings to a photo of Chicago. Click on thumbnails to see full applied style images.

Implementation Details

Our implementation uses TensorFlow to train a fast style transfer network. We use roughly the same transformation network as described in Johnson, except that batch normalization is replaced with Ulyanov's instance normalization, and the scaling/offset of the output tanh layer is slightly different. We use a loss function close to the one described in Gatys, using VGG19 instead of VGG16 and typically using "shallower" layers than in Johnson's implementation (e.g. we use relu1_1 rather than relu1_2). Empirically, this results in larger scale style features in transformations.

Virtual Environment Setup (Anaconda) - Windows/Linux

Tested on

Spec
Operating System	Windows 10 Home
GPU	Nvidia GTX 2080 TI
CUDA Version	11.0
Driver Version	445.75

Step 1：Install Anaconda

https://docs.anaconda.com/anaconda/install/

Step 2：Build a virtual environment

Run the following commands in sequence in Anaconda Prompt:

conda create -n tf-gpu tensorflow-gpu=2.1.0
conda activate tf-gpu

Run the following command in the notebook or just conda install the package:

!pip install moviepy==1.0.2

Follow the commands below to use fast-style-transfer

Documentation

Training Style Transfer Networks

Use style.py to train a new style transfer network. Run python style.py to view all the possible parameters. Training takes 4-6 hours on a Maxwell Titan X. More detailed documentation here. Before you run this, you should run setup.sh. Example usage:

python main.py --style path/to/style/img.jpg \
  --checkpoint-dir checkpoint/path \
  --test path/to/test/img.jpg \
  --test-dir path/to/test/dir \
  --content-weight 1.5e1 \
  --checkpoint-iterations 1000 \
  --batch-size 20

Evaluating Style Transfer Networks

Use evaluate.py to evaluate a style transfer network. Run python evaluate.py to view all the possible parameters. Evaluation takes 100 ms per frame (when batch size is 1) on a Maxwell Titan X. More detailed documentation here. Takes several seconds per frame on a CPU. Models for evaluation are located here. Example usage:

python eval.py --checkpoint path/to/style/model.ckpt \
  --in-path dir/of/test/imgs/ \
  --out-path dir/for/results/

Requirements

You will need the following to run the above:

TensorFlow 0.11.0
Python 2.7.9, Pillow 3.4.2, scipy 0.18.1, numpy 1.11.2
If you want to train (and don't want to wait for 4 months):
- A decent GPU
- All the required NVIDIA software to run TF on a GPU (cuda, etc)
ffmpeg 3.1.3 if you want to stylize video

TensorFlow CNN for fast style transfer

Related tags

Overview

Fast Style Transfer in TensorFlow

Image Stylization

Implementation Details

Virtual Environment Setup (Anaconda) - Windows/Linux

Step 1：Install Anaconda

Step 2：Build a virtual environment

Documentation

Training Style Transfer Networks

Evaluating Style Transfer Networks

Requirements

Owner

PyTorch implementations for our SIGGRAPH 2021 paper: Editable Free-viewpoint Video Using a Layered Neural Representation.

Depth-Aware Video Frame Interpolation (CVPR 2019)

A Peer-to-peer Platform for Secure, Privacy-preserving, Decentralized Data Science

Implementation of OpenAI paper with Simple Noise Scale on Fastai V2

Code for Quantifying Ignorance in Individual-Level Causal-Effect Estimates under Hidden Confounding

A Novel Plug-in Module for Fine-grained Visual Classification

Official Repo for Ground-aware Monocular 3D Object Detection for Autonomous Driving

Dirty Pixels: Towards End-to-End Image Processing and Perception

EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering

Implementation of the famous Image Manipulation\Forgery Detector "ManTraNet" in Pytorch

Scaling Vision with Sparse Mixture of Experts

Learning kernels to maximize the power of MMD tests

Official Tensorflow implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation (ICLR 2020)

Official implementation of SynthTIGER (Synthetic Text Image GEneratoR) ICDAR 2021

Code for ACL2021 long paper: Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases

PyTorch code for ICPR 2020 paper Future Urban Scene Generation Through Vehicle Synthesis

pytorch implementation of fast-neural-style

HybVIO visual-inertial odometry and SLAM system

Implementation of our paper 'RESA: Recurrent Feature-Shift Aggregator for Lane Detection' in AAAI2021.

CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP