PyTorch Implementation of the paper Single Image Texture Translation for Data Augmentation

Last update: Jan 05, 2023

Related tags

Text Data & NLP SITT

Overview

SITT

The repo contains official PyTorch Implementation of the paper Single Image Texture Translation for Data Augmentation.

Authors:

Overview

Recent advances in image synthesis enables one to translate images by learning the mapping between a source domain and a target domain. Existing methods tend to learn the distributions by training a model on a variety of datasets, with results evaluated largely in a subjective manner. Relatively few works in this area, however, study the potential use of semantic image translation methods for image recognition tasks. In this paper, we explore the use of Single Image Texture Translation (SITT) for data augmentation. We first propose a lightweight model for translating texture to images based on a single input of source texture, allowing for fast training and testing. Based on SITT, we then explore the use of augmented data in long-tailed and few-shot image classification tasks. We find the proposed method is capable of translating input data into a target domain, leading to consistent improved image recognition performance. Finally, we examine how SITT and related image translation methods can provide a basis for a data-efficient, augmentation engineering approach to model training.

Usage

Environment

CUDA 10.1, pytorch 1.3.1

Dataset Preparation

	dataset	url
0	SITT leaves images from Plant Pathology 2020	download

Running

bash run.sh

More will be updated

If you find this repo useful, please cite:

@article{li2021single,
  title={Single Image Texture Translation for Data Augmentation},
  author={Li, Boyi and Cui, Yin and Lin, Tsung-Yi and Belongie, Serge},
  journal={arXiv preprint arXiv:2106.13804},
  year={2021}
}

PyTorch Implementation of the paper Single Image Texture Translation for Data Augmentation

Related tags

Overview

SITT

Authors:

Overview

Usage

Environment

Dataset Preparation

Running

More will be updated

Owner

Boyi Li

Simple and efficient RevNet-Library with DeepSpeed support

Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form.

ZUNIT - Toward Zero-Shot Unsupervised Image-to-Image Translation

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

a CTF web challenge about making screenshots

A deep learning-based translation library built on Huggingface transformers

HiFi DeepVariant + WhatsHap workflowHiFi DeepVariant + WhatsHap workflow

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

T‘rex Park is a Youzan sponsored project. Offering Chinese NLP and image models pretrained from E-commerce datasets

a chinese segment base on crf

The aim of this task is to predict someone's English proficiency based on a text input.

Revisiting Pre-trained Models for Chinese Natural Language Processing (Findings of EMNLP 2020)

AI_Assistant - This is a Python based Voice Assistant.

This is Assignment1 code for the Web Data Processing System.

Chinese NewsTitle Generation Project by GPT2.带有超级详细注释的中文GPT2新闻标题生成项目。

Creating a Feed of MISP Events from ThreatFox (by abuse.ch)

Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

SurvTRACE: Transformers for Survival Analysis with Competing Events

Idea is to build a model which will take keywords as inputs and generate sentences as outputs.

The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.