Unofficial implementation of the ImageNet, CIFAR 10 and SVHN Augmentation Policies learned by AutoAugment using pillow

Last update: Jan 02, 2023

Related tags

Overview

AutoAugment - Learning Augmentation Policies from Data

Unofficial implementation of the ImageNet, CIFAR10 and SVHN Augmentation Policies learned by AutoAugment, described in this Google AI Blogpost.

Update July 13th, 2018: Wrote a Blogpost about AutoAugment and Double Transfer Learning.

Tested with Python 3.6. Needs pillow>=5.0.0

Example

from autoaugment import ImageNetPolicy
image = PIL.Image.open(path)
policy = ImageNetPolicy()
transformed = policy(image)

To see examples of all operations and magnitudes applied to images, take a look at AutoAugment_Exploration.ipynb.

Example as a PyTorch Transform - ImageNet

from autoaugment import ImageNetPolicy
data = ImageFolder(rootdir, transform=transforms.Compose(
                        [transforms.RandomResizedCrop(224), 
                         transforms.RandomHorizontalFlip(), ImageNetPolicy(), 
                         transforms.ToTensor(), transforms.Normalize(...)]))
loader = DataLoader(data, ...)

Example as a PyTorch Transform - CIFAR10

from autoaugment import CIFAR10Policy
data = ImageFolder(rootdir, transform=transforms.Compose(
                        [transforms.RandomCrop(32, padding=4, fill=128), # fill parameter needs torchvision installed from source
                         transforms.RandomHorizontalFlip(), CIFAR10Policy(), 
			 transforms.ToTensor(), 
                         Cutout(n_holes=1, length=16), # (https://github.com/uoguelph-mlrg/Cutout/blob/master/util/cutout.py)
                         transforms.Normalize(...)]))
loader = DataLoader(data, ...)

Example as a PyTorch Transform - SVHN

from autoaugment import SVHNPolicy
data = ImageFolder(rootdir, transform=transforms.Compose(
                        [SVHNPolicy(), 
			 transforms.ToTensor(), 
                         Cutout(n_holes=1, length=20), # (https://github.com/uoguelph-mlrg/Cutout/blob/master/util/cutout.py)
                         transforms.Normalize(...)]))
loader = DataLoader(data, ...)

Results with AutoAugment

Generalizable Data Augmentations

Finally, we show that policies found on one task can generalize well across different models and datasets. For example, the policy found on ImageNet leads to significant improvements on a variety of FGVC datasets. Even on datasets for which fine-tuning weights pre-trained on ImageNet does not help significantly [26], e.g. Stanford Cars [27] and FGVC Aircraft [28], training with the ImageNet policy reduces test set error by 1.16% and 1.76%, respectively. This result suggests that transferring data augmentation policies offers an alternative method for transfer learning.

Unofficial implementation of the ImageNet, CIFAR 10 and SVHN Augmentation Policies learned by AutoAugment using pillow

Related tags

Overview

AutoAugment - Learning Augmentation Policies from Data

Tested with Python 3.6. Needs pillow>=5.0.0

Example

Example as a PyTorch Transform - ImageNet

Example as a PyTorch Transform - CIFAR10

Example as a PyTorch Transform - SVHN

Results with AutoAugment

Generalizable Data Augmentations

CIFAR 10

CIFAR 100

ImageNet

SVHN

Fine Grained Visual Classification Datasets

Owner

Philip Popien

Code for layerwise detection of linguistic anomaly paper (ACL 2021)

This is the reference implementation for "Coresets via Bilevel Optimization for Continual Learning and Streaming"

The official repository for Deep Image Matting with Flexible Guidance Input

TCPNet - Temporal-attentive-Covariance-Pooling-Networks-for-Video-Recognition

Collision risk estimation using stochastic motion models

Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)

Supplemental Code for "ImpressionNet :A Multi view Approach to Predict Socio Facial Impressions"

UDP++ (ECCVW 2020 Oral), (Winner of COCO 2020 Keypoint Challenge).

🍷 Gracefully claim weekly free games and monthly content from Epic Store.

AOT (Associating Objects with Transformers) in PyTorch

YolactEdge: Real-time Instance Segmentation on the Edge

Baseline model for "GraspNet-1Billion: A Large-Scale Benchmark for General Object Grasping" (CVPR 2020)

Deep Q-learning for playing chrome dino game

Download files from DSpace systems (because for some reason DSpace won't let you)

VISNOTATE: An Opensource tool for Gaze-based Annotation of WSI Data

ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration

[MICCAI'20] AlignShift: Bridging the Gap of Imaging Thickness in 3D Anisotropic Volumes

PyTorch implementation of Tacotron speech synthesis model.

Training, generation, and analysis code for Learning Particle Physics by Example: Location-Aware Generative Adversarial Networks for Physics

Recovering Brain Structure Network Using Functional Connectivity