DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

Last update: Dec 03, 2022

Related tags

Overview

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

Figure: Joint multi-attribute edits using DyStyle model.

Great diversity and photorealism have been achieved by unconditional GAN frameworks such as StyleGAN and its variations. In the meantime, persistent efforts have been made to enhance the semantic controllability of StyleGANs. For example, a dozen of style manipulation methods have been recently proposed to perform attribute-conditioned style editing. Although some of these methods work well in manipulating the style codes along one attribute, the control accuracy when jointly manipulating multiple attributes tends to be problematic. To address these limitations, we propose a Dynamic Style Manipulation Network (DyStyle) whose structure and parameters vary by input samples, to perform nonlinear and adaptive manipulation of latent codes for flexible and precise attribute control. Additionally, a novel easy-to-hard training procedure is introduced for efficient and stable training of the DyStyle network. Extensive experiments have been conducted on faces and other objects. As a result, our approach demonstrates fine-grained disentangled edits along multiple numeric and binary attributes. Qualitative and quantitative comparisons with existing style manipulation methods verify the superiority of our method in terms of the attribute control accuracy and identity preservation without compromising the photorealism. The advantage of our method is even more significant for joint multi-attribute control.

[paper]

Demo

Single Attribute edits

examples when editing facial expressions on real face.

examples when editing pupil color, hair color, mouth size and hair length on anime face.

examples when editing eye, mouth, yaw and age on cat face.

Multiple Attribute Edits

examples when editing both yaw and glass on real face.

images before editing

images after editing

Real photo editing reconstructed with pSp and edited with DyStyle.

Installation

Clone this repo.
This code require PyTorch, Python 3+. Please install the dependencies by

conda env create -f environment.yml

Editing Images with Pretrained Model

Before editing images, you need to prepare the checkpoint, GAN generator and edit config.

Then you can just run the following scripts,

sh run_test_adult.sh [device_id]
sh run_test_anime.sh [device_id]
sh run_test_cat.sh [device_id]
sh run_test_dog.sh [device_id]

Training DyStyle Model

Before training DyStyle Model, you need to prepare the GAN generator, Attribute Classifier and model config.

Then you can just run the following scripts,

sh run_train_adult.sh [device_id]
sh run_train_anime.sh [device_id]
sh run_train_cat.sh [device_id]
sh run_train_dog.sh [device_id]

License

Citation

To be updated.

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

Related tags

Overview

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

Demo

Single Attribute edits

Multiple Attribute Edits

Installation

Editing Images with Pretrained Model

Training DyStyle Model

License

Citation

Owner

AI Virtual Calculator: This is a simple virtual calculator based on Artificial intelligence.

Pytorch implementation of Masked Auto-Encoder

Efficient Speech Processing Tookit for Automatic Speaker Recognition

Predictive Modeling on Electronic Health Records(EHR) using Pytorch

Spectralformer: Rethinking hyperspectral image classification with transformers

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

CM building dataset Timisoara

An Extendible (General) Continual Learning Framework based on Pytorch - official codebase of Dark Experience for General Continual Learning

Code accompanying "Adaptive Methods for Aggregated Domain Generalization"

[ICLR'21] FedBN: Federated Learning on Non-IID Features via Local Batch Normalization

This repository accompanies the ACM TOIS paper "What can I cook with these ingredients?" - Understanding cooking-related information needs in conversational search

Real-time Neural Representation Fusion for Robust Volumetric Mapping

Intelligent Video Analytics toolkit based on different inference backends.

A fast, dataset-agnostic, deep visual search engine for digital art history

AI创造营：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人

PyTorch implementation of adversarial patch

social humanoid robots with GPGPU and IoT

Pairwise model for commonlit competition

Database Reasoning Over Text project for ACL paper

Implementing a simplified copy of Shazam application from scratch using MinHashing and LSH.

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

Related tags

Overview

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

Demo

Single Attribute edits

Multiple Attribute Edits

Installation

Editing Images with Pretrained Model

Training DyStyle Model

License

Citation

Owner

AI Virtual Calculator: This is a simple virtual calculator based on Artificial intelligence.

Pytorch implementation of Masked Auto-Encoder

Efficient Speech Processing Tookit for Automatic Speaker Recognition

Predictive Modeling on Electronic Health Records(EHR) using Pytorch

Spectralformer: Rethinking hyperspectral image classification with transformers

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

CM building dataset Timisoara

An Extendible (General) Continual Learning Framework based on Pytorch - official codebase of Dark Experience for General Continual Learning

Code accompanying "Adaptive Methods for Aggregated Domain Generalization"

[ICLR'21] FedBN: Federated Learning on Non-IID Features via Local Batch Normalization

This repository accompanies the ACM TOIS paper "What can I cook with these ingredients?" - Understanding cooking-related information needs in conversational search

Real-time Neural Representation Fusion for Robust Volumetric Mapping

Intelligent Video Analytics toolkit based on different inference backends.

A fast, dataset-agnostic, deep visual search engine for digital art history

AI创造营 ：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人

PyTorch implementation of adversarial patch

social humanoid robots with GPGPU and IoT

Pairwise model for commonlit competition

Database Reasoning Over Text project for ACL paper

Implementing a simplified copy of Shazam application from scratch using MinHashing and LSH.

AI创造营：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人