Official PaddlePaddle implementation of Paint Transformer

Last update: Dec 31, 2022

Related tags

Deep Learning PaintTransformer

Overview

Paint Transformer: Feed Forward Neural Painting with Stroke Prediction

[Paper] [Paddle Implementation]

Update

We have optimized the serial inference procedure to achieve better rendering quality and faster speed.

Overview

This repository contains the official PaddlePaddle implementation of paper:

Paint Transformer: Feed Forward Neural Painting with Stroke Prediction,

Songhua Liu*, Tianwei Lin*, Dongliang He, Fu Li, Ruifeng Deng, Xin Li, Errui Ding, Hao Wang (* indicates equal contribution)

ICCV 2021 (Oral)

Prerequisites

Linux or macOS
Python 3.6+
PaddlePaddle 2.0+ and other dependencies (numpy, cv2, and other common python libs)
```
python -m pip install paddlepaddle-gpu
```

Getting Started

Clone this repository:

git clone https://github.com/wzmsltw/PaintTransformer
cd PaintTransformer

Download pretrained model from Google Drive and move it to inference directory:
```
mv [Download Directory]/paint_best.pdparams inference/
cd inference
```
Inference:
```
python inference.py
```
- Input image path, output path, and etc can be set in the main function.
- Notably, there is a flag serial as one parameter of the main function:
  - If serial is True, strokes would be rendered serially. The consumption of video memory will be low but it requires more time. Serial inference can achieve better rendering quality.
  - If serial is False, strokes would be rendered in parallel. The consumption of video memory will be high but it would be faster.
  - If animated results are required, serial must be True.
Train:
- You can send email to us for the training codes.

More Results

Input	Animated Output

App

Do not want to run the code? Try an App 一刻相册 downloaded from here!

Citation

If you find ideas or codes useful for your research, please cite:

@inproceedings{liu2021paint,
  title={Paint Transformer: Feed Forward Neural Painting with Stroke Prediction},
  author={Liu, Songhua and Lin, Tianwei and He, Dongliang and Li, Fu and Deng, Ruifeng and Li, Xin and Ding, Errui and Wang, Hao},
  booktitle={Proceedings of the IEEE International Conference on Computer Vision},
  year={2021}
}

Contact

For any question, please file an issue or contact

Songhua Liu: s[email protected]
Tianwei Lin: [email protected]

Official PaddlePaddle implementation of Paint Transformer

Related tags

Overview

Paint Transformer: Feed Forward Neural Painting with Stroke Prediction

Update

Overview

Prerequisites

Getting Started

More Results

App

Citation

Contact

Owner

TianweiLin

Deploying PyTorch Model to Production with FastAPI in CUDA-supported Docker

Fast and exact ILP-based solvers for the Minimum Flow Decomposition (MFD) problem, and variants of it.

Rethinking of Pedestrian Attribute Recognition: A Reliable Evaluation under Zero-Shot Pedestrian Identity Setting

Reference PyTorch implementation of "End-to-end optimized image compression with competition of prior distributions"

Semantic Segmentation in Pytorch. Network include: FCN、FCN_ResNet、SegNet、UNet、BiSeNet、BiSeNetV2、PSPNet、DeepLabv3_plus、 HRNet、DDRNet

Implementation of SE3-Transformers for Equivariant Self-Attention, in Pytorch.

Code for the ICCV 2021 paper "Pixel Difference Networks for Efficient Edge Detection" (Oral).

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python Single Object Tracking Evaluation

Optimized primitives for collective multi-GPU communication

Multi-Scale Geometric Consistency Guided Multi-View Stereo

Recursive Bayesian Networks

Codes for "CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation"

Experimental Python implementation of OpenVINO Inference Engine (very slow, limited functionality). All codes are written in Python. Easy to read and modify.

시각 장애인을 위한 스마트 지팡이에 활용될 딥러닝 모델 (DL Model Repo)

TensorFlow implementation of the algorithm in the paper "Decoupled Low-light Image Enhancement"

Read and write layered TIFF ImageSourceData and ImageResources tags

Running Google MoveNet Multipose Tracking models on OpenVINO.

The official codes for the ICCV2021 presentation "Uniformity in Heterogeneity: Diving Deep into Count Interval Partition for Crowd Counting"

AMTML-KD: Adaptive Multi-teacher Multi-level Knowledge Distillation