ArcaneGAN by Alex Spirin

Last update: Dec 28, 2022

Related tags

Deep Learning ArcaneGAN

Overview

ArcaneGAN by Alex Spirin

Changelog

2021-12-12 ArcaneGAN v0.3 is live
2021-12-09 Thanks to ak92501 we now have a huggingface demo

ArcaneGAN v0.3

Videos processed by the huggingface video inference colab.

obama2.mp4

ryan2.mp4

Image samples

Faces were enhanced via GPEN before applying the ArcaneGAN v0.3 filter.

ArcaneGAN v0.2

The release is here

Implementation Details

It does something, but not much at the moment.

The model is a pytroch *.jit of a fastai v1 flavored u-net trained on a paired dataset, generated via a blended stylegan2. You can see the blending colab I've used here.

Comments

How to convert the FastAI model to Pytorch JIT

Hi,

I trained a model with unet_learner but I can't convert it to jit.

I run the following code: torch.jit.save(torch.jit.script(learn.model), 'jit.pt')

Here is the error:

UnsupportedNodeError: GeneratorExp aren't supported: File "/usr/local/lib/python3.7/dist-packages/fastai/callbacks/hooks.py", line 21 "Applieshook_functomodule,input,output." if self.detach: input = (o.detach() for o in input ) if is_listy(input ) else input.detach() ~ <--- HERE output = (o.detach() for o in output) if is_listy(output) else output.detach() self.stored = self.hook_func(module, input, output)

May I know how you convert it to a jit model? Thanks

opened by ramtiin 2
Ошибка

Добрый вечер.В ArcaneGAN на colab for videos,выдаёт ошибку:

RuntimeError: CUDA out of memory. Tried to allocate 2.80 GiB (GPU 0; 11.17 GiB total capacity; 5.74 GiB already allocated; 2.21 GiB free; 8.44 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

Помогите пожалуйста!

opened by Zzip7 2
How do you change the style of the whole image

Nice work! My only confusion is how you change the style of the whole image instead of just the face. Usually, StyleGAN generates aligned face images by fine-tuning the FFHQ checkpoint. How does the pix2pix model trained with these face image pairs work with the full image or frame.

opened by zhanglonghao1992 2
Architecture for video

Hi, what does the architecture look like? Is it similar to Pix2Pix? And for processing of the video, are you doing anything extra to make sure the frames are consistent?

opened by unography 2
How to prevent eyes occur in nose?

Hello, I try your model and it's amazing, but I find in some pictures if the nose is too big, there will be eyes in the nose. I try to lower the 'target_face' and it can work. But the details like the light of the eyes and background will also lose when I lower the 'target_face'. So I wonder is there a way to prevent the eyes occurs in the nose and keep the details in the meantime?

opened by Folkfive 1
support arbitrary image size?

Great work!

The unet prediction result will be cropped to be the same size as the training input, e.g. 256 or 512. For arbitrary image size (e.g. 1280*720), how to config or set the model to output the same size of the input image as your colab did? Thank you.

opened by foobarhe 1
RuntimeError: CUDA out of memory

Добрый вечер.Извините,это опять я.Снова эта ошибка появляется.Можно ли,самому эту ошибку решать?Или исправлять можете только вы?Обьясните пожалуйста подробно.

opened by Zzip7 1
about the paired datasets generated by stylegan

how do you make sure the background and expression similarity between the generated input(face) and target(style face) ? I find that the style is too weak when less finetune and the similarity is too weak when more finetune, how do you solve it ? Would you like to share the paired datasets generated code with me ? thanks a lot ~

opened by Leocien 1
Any news for training code?

Interesting topic... I wonder how you trained the model, especially the augmentation part. Fixed crop limitation is a well-known problem and would like to know how you handle it. :)

opened by dongyun-kim-arch 0
tuple issue

Was trying the ArcaneGan video colab but I am having a tuple issue can you please help, i am really excited to try the Arcane video can you please help out

opened by mau021 0
What GPU is used for training?

Hi,

I want to train the Fastai u-net model. However, when I try to train the critic (learn_critic.fit_one_cycle(6, 1e-3)), I get the following error:

CUDA out of memory. Tried to allocate 4.00 GiB (GPU 0; 14.76 GiB total capacity; 9.78 GiB already allocated; 891.75 MiB free; 12.57 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

The GPU is a Tesla T4 with 16 GB of VRAM. My batch size is 4 and the training images size is 512*512. I also tried with lower numbers, but I'm still getting the same error.

opened by ramtiin 2
How to make the style stronger?

The following are input image, my training output from pair label supervision, and the output from your test model。 I trained my model (Super-Resolution model) on the images from your model outputs, I find it difficult to change the facial features。 Like the eyes and face texture are changed, how to do it ? I use L1Loss (weight is 1) + PerceptualLoss (weight is 1)+ GANLoss (weight is 0.1),

opened by xuanandsix 1

Releases(v0.4)

v0.4(Dec 25, 2021)
ArcaneGAN v0.4

The main differences are:

lighter styling (closer to original input)

sharper result

happier faces

reduced childish eyes effect

reduced stubble on feminine faces

increased temporal stability on videos

reduced mouth\teeth artifacts

Image samples

v0.3 vs v0.4

Video samples

https://user-images.githubusercontent.com/11751592/146966428-f4e27929-19dd-423f-a772-8aee709d2116.mp4

https://user-images.githubusercontent.com/11751592/146966462-6511998e-77f5-4fd2-8ad9-5709bf0cd172.mp4
Source code(tar.gz)
Source code(zip)
ArcaneGANv0.4.jit(59.75 MB)
v0.3(Dec 12, 2021)

ArcaneGAN v0.3

Video samples

This is a stronger-styled version. It performs okay on videos, though visible flickering is present. Here are some video examples.

https://user-images.githubusercontent.com/11751592/145702737-c02b8b00-ad30-4358-98bf-97c8ad7fefdf.mp4

https://user-images.githubusercontent.com/11751592/145702740-afd3377d-d117-467d-96ca-045e25d85ac6.mp4

Image samples

Faces were enhanced via GPEN before applying the ArcaneGAN v0.3 filter.

The model is a pytroch *.jit of a fastai v1 flavored u-net trained on a paired dataset, generated via a blended stylegan2. You can see the blending colab I've used here.
Source code(tar.gz)
Source code(zip)
ArcaneGANv0.3.jit(79.40 MB)
v0.2(Dec 7, 2021)

ArcaneGAN v0.2 This version is a bit better at doing something other than making images darker :D

Here are some image pairs. I've specifically picked various images to see how the model performs in the wild, not on aligned and cropped faces.

The model is a pytroch *.jit of a fastai v1 flavored u-net trained on a paired dataset, generated via a blended stylegan2. You can see the blending colab I've used here.

Inference notebook is here
Source code(tar.gz)
Source code(zip)
ArcaneGANv0.2.jit(79.52 MB)
v0.1(Dec 6, 2021)

ArcaneGAN v0.1 This is a proof of concept release. The model is in beta (which means it's beta than nothin')

Here are some image pairs. I've specifically picked various images to see how the model performs in the wild, not on aligned and cropped faces.

It does something, but not much at the moment.

The model is a pytroch *.jit of a fastai v1 flavored u-net trained on a paired dataset, generated via a blended stylegan2. You can see the blending colab I've used here.

Inference notebook is here
Source code(tar.gz)
Source code(zip)
ArcaneGANv0.1.jit(79.53 MB)

Owner

Alex

GitHub Repository

Sentiment analysis translations of the Bhagavad Gita

Sentiment and Semantic Analysis of Bhagavad Gita Translations It is well known that translations of songs and poems not only breaks rhythm and rhyming

3 Aug 01, 2022

Cascading Feature Extraction for Fast Point Cloud Registration (BMVC 2021)

Cascading Feature Extraction for Fast Point Cloud Registration This repository contains the source code for the paper [Arxive link comming soon]. Meth

7 May 26, 2022

A python script to dump all the challenges locally of a CTFd-based Capture the Flag.

A python script to dump all the challenges locally of a CTFd-based Capture the Flag. Features Connects and logins to a remote CTFd instance. Dumps all

77 Dec 07, 2022

Range Image-based LiDAR Localization for Autonomous Vehicles Using Mesh Maps

Range Image-based 3D LiDAR Localization This repo contains the code for our ICRA2021 paper: Range Image-based LiDAR Localization for Autonomous Vehicl

208 Dec 15, 2022

FuseDream: Training-Free Text-to-Image Generationwith Improved CLIP+GAN Space OptimizationFuseDream: Training-Free Text-to-Image Generationwith Improved CLIP+GAN Space Optimization

FuseDream This repo contains code for our paper (paper link): FuseDream: Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimizat

191 Dec 31, 2022

An open source machine learning library for performing regression tasks using RVM technique.

Introduction neonrvm is an open source machine learning library for performing regression tasks using RVM technique. It is written in C programming la

33 May 31, 2022

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

AlphaZero-Gomoku This is an implementation of the AlphaZero algorithm for playing the simple board game Gomoku (also called Gobang or Five in a Row) f

2.8k Dec 26, 2022

PyTorch version implementation of DORN

DORN_PyTorch This is a PyTorch version implementation of DORN Reference H. Fu, M. Gong, C. Wang, K. Batmanghelich and D. Tao: Deep Ordinal Regression

3 Apr 27, 2022

A Pytorch implement of paper "Anomaly detection in dynamic graphs via transformer" (TADDY).

TADDY: Anomaly detection in dynamic graphs via transformer This repo covers an reference implementation for the paper "Anomaly detection in dynamic gr

21 Nov 24, 2022

Some code of the implements of Geological Modeling Using 3D Pixel-Adaptive and Deformable Convolutional Neural Network

3D-GMPDCNN Geological Modeling Using 3D Pixel-Adaptive and Deformable Convolutional Neural Network PyTorch implementation of "Geological Modeling Usin

5 Nov 21, 2022

RepVGG: Making VGG-style ConvNets Great Again

This repository is the code that needs to be submitted for OpenMMLab Algorithm Ecological Challenge，the paper is RepVGG: Making VGG-style ConvNets Great Again

62 May 21, 2022

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning. CVPR 2018

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning Tensorflow code and models for the paper: Large Scale Fine-Grained Categ

187 Oct 01, 2022

Deep Learning GPU Training System

DIGITS DIGITS (the Deep Learning GPU Training System) is a webapp for training deep learning models. The currently supported frameworks are: Caffe, To

4.1k Jan 03, 2023

Non-Homogeneous Poisson Process Intensity Modeling and Estimation using Measure Transport

Non-Homogeneous Poisson Process Intensity Modeling and Estimation using Measure Transport This GitHub page provides code for reproducing the results i

1 Nov 08, 2021

OneFlow is a performance-centered and open-source deep learning framework.

OneFlow OneFlow is a performance-centered and open-source deep learning framework. Latest News Version 0.5.0 is out! First class support for eager exe

4.2k Jan 07, 2023

Pre-trained BERT Models for Ancient and Medieval Greek, and associated code for LaTeCH 2021 paper titled - "A Pilot Study for BERT Language Modelling and Morphological Analysis for Ancient and Medieval Greek"

Ancient Greek BERT The first and only available Ancient Greek sub-word BERT model! State-of-the-art post fine-tuning on Part-of-Speech Tagging and Mor

22 Dec 08, 2022

Predicting path with preference based on user demonstration using Maximum Entropy Deep Inverse Reinforcement Learning in a continuous environment

Preference-Planning-Deep-IRL Introduction Check my portfolio post Dependencies Gym stable-baselines3 PyTorch Usage Take Demonstration python3 record.

9 Oct 26, 2022

ArcaneGAN by Alex Spirin

Related tags

Overview

ArcaneGAN by Alex Spirin

ArcaneGAN v0.3

Image samples

ArcaneGAN v0.2

Implementation Details

Comments

Releases(v0.4)

v0.4(Dec 25, 2021)

ArcaneGAN v0.4

Image samples

Video samples

v0.3(Dec 12, 2021)

ArcaneGAN v0.3

Video samples

Image samples

v0.2(Dec 7, 2021)

v0.1(Dec 6, 2021)

Owner

Alex

Sentiment analysis translations of the Bhagavad Gita

Cascading Feature Extraction for Fast Point Cloud Registration (BMVC 2021)

A python script to dump all the challenges locally of a CTFd-based Capture the Flag.

Range Image-based LiDAR Localization for Autonomous Vehicles Using Mesh Maps

FuseDream: Training-Free Text-to-Image Generationwith Improved CLIP+GAN Space OptimizationFuseDream: Training-Free Text-to-Image Generationwith Improved CLIP+GAN Space Optimization

An open source machine learning library for performing regression tasks using RVM technique.

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

PyTorch version implementation of DORN

A Pytorch implement of paper "Anomaly detection in dynamic graphs via transformer" (TADDY).

Some code of the implements of Geological Modeling Using 3D Pixel-Adaptive and Deformable Convolutional Neural Network

RepVGG: Making VGG-style ConvNets Great Again

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning. CVPR 2018

Deep Learning GPU Training System

Non-Homogeneous Poisson Process Intensity Modeling and Estimation using Measure Transport

OneFlow is a performance-centered and open-source deep learning framework.

Pre-trained BERT Models for Ancient and Medieval Greek, and associated code for LaTeCH 2021 paper titled - "A Pilot Study for BERT Language Modelling and Morphological Analysis for Ancient and Medieval Greek"

Predicting path with preference based on user demonstration using Maximum Entropy Deep Inverse Reinforcement Learning in a continuous environment

Pytorch implementation of "Training a 85.4% Top-1 Accuracy Vision Transformer with 56M Parameters on ImageNet"

Library for time-series-forecasting-as-a-service.

Official implementation of Deep Reparametrization of Multi-Frame Super-Resolution and Denoising