[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Last update: Dec 30, 2022

Overview

[CVPR2022] Thin-Plate Spline Motion Model for Image Animation

Source code of the CVPR'2022 paper "Thin-Plate Spline Motion Model for Image Animation"

Example animation

PS: The paper trains the model for 100 epochs for a fair comparison. You can use more data and train for more epochs to get better performance.

Web demo for animation

Try the web demo for animation here:
Google Colab:

Pre-trained models

Installation

We support python3.(Recommended version is Python 3.9). To install the dependencies run:

pip install -r requirements.txt

YAML configs

There are several configuration files one for each dataset in the config folder named as config/dataset_name.yaml.

See description of the parameters in the config/taichi-256.yaml.

Datasets

MGif. Follow Monkey-Net.
TaiChiHD and VoxCeleb. Follow instructions from video-preprocessing.
TED-talks. Follow instructions from MRAA.

Training

To train a model on specific dataset run:

CUDA_VISIBLE_DEVICES=0,1 python run.py --config config/dataset_name.yaml --device_ids 0,1

A log folder named after the timestamp will be created. Checkpoints, loss values, reconstruction results will be saved to this folder.

Training AVD network

To train a model on specific dataset run:

CUDA_VISIBLE_DEVICES=0 python run.py --mode train_avd --checkpoint '{checkpoint_folder}/checkpoint.pth.tar' --config config/dataset_name.yaml

Checkpoints, loss values, reconstruction results will be saved to {checkpoint_folder}.

Evaluation on video reconstruction

To evaluate the reconstruction performance run:

CUDA_VISIBLE_DEVICES=0 python run.py --mode reconstruction --config config/dataset_name.yaml --checkpoint '{checkpoint_folder}/checkpoint.pth.tar'

The reconstruction subfolder will be created in {checkpoint_folder}. The generated video will be stored to this folder, also generated videos will be stored in png subfolder in loss-less '.png' format for evaluation. To compute metrics, follow instructions from pose-evaluation.

Image animation demo

notebook: demo.ipynb, edit the config cell and run for image animation.
python:

CUDA_VISIBLE_DEVICES=0 python demo.py --config config/vox-256.yaml --checkpoint checkpoints/vox.pth.tar --source_image ./source.jpg --driving_video ./driving.mp4

Acknowledgments

The main code is based upon FOMM and MRAA

Thanks for the excellent works!

Thanks iperov, this work has been integrated in DeepFaceLive

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Related tags

Overview

[CVPR2022] Thin-Plate Spline Motion Model for Image Animation

Example animation

Web demo for animation

Pre-trained models

Installation

YAML configs

Datasets

Training

Training AVD network

Evaluation on video reconstruction

Image animation demo

Acknowledgments

Owner

yoyo-nb

MLP-Numpy - A simple modular implementation of Multi Layer Perceptron in pure Numpy.

Deep Learning and Logical Reasoning from Data and Knowledge

Multiple custom object count and detection using YOLOv3-Tiny method

Implementation for paper LadderNet: Multi-path networks based on U-Net for medical image segmentation

Improving the robustness and performance of biomedical NLP models through adversarial training

Serverless proxy for Spark cluster

Using Machine Learning to Create High-Res Fine Art

StocksMA is a package to facilitate access to financial and economic data of Moroccan stocks.

Connecting Java/ImgLib2 + Python/NumPy

Official implementation of Deep Convolutional Dictionary Learning for Image Denoising.

Simple, but essential Bayesian optimization package

Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"

CATE: Computation-aware Neural Architecture Encoding with Transformers

[Preprint] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang, Zhangyang Wang

SimDeblur is a simple framework for image and video deblurring, implemented by PyTorch

Deep learning library featuring a higher-level API for TensorFlow.

Flappy bird automation using Neuroevolution of Augmenting Topologies (NEAT) in Python

Official code for the paper "Self-Supervised Prototypical Transfer Learning for Few-Shot Classification"

A general-purpose, flexible, and easy-to-use simulator alongside an OpenAI Gym trading environment for MetaTrader 5 trading platform (Approved by OpenAI Gym)

Graduation Project