Official Pytorch Implementation for Splicing ViT Features for Semantic Appearance Transfer presenting Splice

Last update: Jan 06, 2023

Overview

Splicing ViT Features for Semantic Appearance Transfer [Project Page]

Splice is a method for semantic appearance transfer, as described in Splicing ViT Features for Semantic Appearance Transfer (link to paper).

Given two input images—a source structure image and a target appearance image–our method generates a new image in which the structure of the source image is preserved, while the visual appearance of the target image is transferred in a semantically aware manner. That is, objects in the structure image are “painted” with the visual appearance of semantically related objects in the appearance image. Our method leverages a self-supervised, pre-trained ViT model as an external semantic prior. This allows us to train our generator only on a single input image pair, without any additional information (e.g., segmentation/correspondences), and without adversarial training. Thus, our framework can work across a variety of objects and scenes, and can generate high quality results in high resolution (e.g., HD).

Getting Started

Installation

git clone https://github.com/omerbt/Splice.git
pip install -r requirements.txt

Run examples

Run the following command to start training

python train.py --dataroot datasets/cows

Intermediate results will be saved to /out/output.png during optimization. The frequency of saving intermediate results is indicated in the save_epoch_freq flag of the configuration.

Sample Results

Citation

@article{Splice2022,
    author = {Tumanyan, Narek
              and Bar-Tal, Omer
              and Bagon, Shai
              and Dekel, Tali
              },
    title = {Splicing ViT Features for Semantic Appearance Transfer}, 
    journal = {arXiv preprint arXiv:2201.00424},
    year  = {2022}
}

Official Pytorch Implementation for Splicing ViT Features for Semantic Appearance Transfer presenting Splice

Related tags

Overview

Splicing ViT Features for Semantic Appearance Transfer [Project Page]

Getting Started

Installation

Run examples

Sample Results

Citation

Owner

Omer Bar Tal

Data and extra materials for the food safety publications classifier

Generate Cartoon Images using Generative Adversarial Network

ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.

This is the codebase for the ICLR 2021 paper Trajectory Prediction using Equivariant Continuous Convolution

Code implementation from my Medium blog post: [Transformers from Scratch in PyTorch]

Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]

Paper: Cross-View Kernel Similarity Metric Learning Using Pairwise Constraints for Person Re-identification

This is the repo for the paper "Improving the Accuracy-Memory Trade-Off of Random Forests Via Leaf-Refinement".

VideoGPT: Video Generation using VQ-VAE and Transformers

Minimal implementation of PAWS (https://arxiv.org/abs/2104.13963) in TensorFlow.

Command-line tool for downloading and extending the RedCaps dataset.

Implementation of the federated dual coordinate descent (FedDCD) method.

STMTrack: Template-free Visual Tracking with Space-time Memory Networks

Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 2021

Code for Two-stage Identifier: "Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition"

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features

Replication of Pix2Seq with Pretrained Model

This project is based on our SIGGRAPH 2021 paper, ROSEFusion: Random Optimization for Online DenSE Reconstruction under Fast Camera Motion .

Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai

Prompts - Read a textfile of prompts and import into anki via ankiconnect