Understanding the Generalization Benefit of Model Invariance from a Data Perspective

Last update: Jan 15, 2022

Related tags

Overview

Understanding the Generalization Benefit of Model Invariance from a Data Perspective

This is the code for our NeurIPS2021 paper "Understanding the Generalization Benefit of Model Invariance from a Data Perspective". There are two major parts in our code: sample covering number estimation and generalization benefit evaluation.

Requirments

Python 3.8
PyTorch
torchvision
scikit-learn-extra
scipy
robustness package (already included in our code)

Our code is based on robustness package.

Dataset

CIFAR-10 Download and extract the data into /data/cifar10
R2N2 Download the ShapeNet rendered images and put the data into /data/r2n2

The randomly sampled R2N2 images used for computing sample covering numbers and indices of examples for different sample sizes could be found here.

Estimation of sample covering numbers

To estimate the sample covering numbers of different data transformations, run the following script in /scn.

CUDA_VISIBLE_DEVICES=0 python run_scn.py  --epsilon 3 --transformation crop --cover_number_method fast --data-path /path/to/dataset

Note that the input is a N x C x H x W tensor where N is sample size.

Evaluation of generalization benefit

To train the model with data augmentation method, run the following script in /learn_invariance for R2N2 dataset

CUDA_VISIBLE_DEVICES=0 python main.py \
    --dataset r2n2 \
    --data ../data/2n2/ShapeNetRendering \
    --metainfo-path ../data/r2n2/metainfo_all.json \
    --transforms view  \
    --inv-method aug \
    --out-dir /path/to/out_dir \
    --arch resnet18 --epoch 110 --lr 1e-2 --step-lr 50 \
    --workers 30 --batch-size 128 --exp-name view

or the following script for CIFAR-10 dataset

CUDA_VISIBLE_DEVICES=0 python main.py \
    --dataset cifar \
    --data ../data/cifar10 \
    --n-per-class all \
    --transforms crop  \
    --inv-method aug \
    --out-dir /path/to/out_dir \
    --arch resnet18 --epoch 110 --lr 1e-2 --step-lr 50 \
    --workers 30 --batch-size 128 --exp-name crop

By setting --transforms to be one of {none, flip, crop, rotate, view}, the specific transformation will be considered.

To train the model with regularization method, run the following script. Currently, the code only support 3d-view transformation on R2N2 dataset.

CUDA_VISIBLE_DEVICES=0 python main.py \
    --dataset r2n2 \
    --data ../data/r2n2/ShapeNetRendering \
    --metainfo-path ../data/r2n2/metainfo_all.json \
    --transforms view  \
    --inv-method reg \
    --inv-method-beta 1 \
    --out-dir /path/to/out_dir \
    --arch resnet18 --epoch 110 --lr 1e-2 --step-lr 50 \
    --workers 30 --batch-size 128 --exp-name reg_view

To evaluate the model with invariance loss and worst-case consistency accuracy, run the following script.

CUDA_VISIBLE_DEVICES=0 python main.py  \
    --dataset r2n2 \
    --data ../data/r2n2/ShapeNetRendering \
    --metainfo-path ../data/r2n2/metainfo_all.json \
    --inv-method reg \
    --arch resnet18 \
    --resume /path/to/checkpoint.pt.best \
    --eval-only 1 \
    --transforms view  \
    --adv-eval 0 \
    --batch-size 2  \
    --no-store

Note that to have the worst-case consistency accuracy we need to load 24 view images in R2N2RenderingsTorch class in dataset_3d.py.

Understanding the Generalization Benefit of Model Invariance from a Data Perspective

Related tags

Overview

Understanding the Generalization Benefit of Model Invariance from a Data Perspective

Requirments

Dataset

Estimation of sample covering numbers

Evaluation of generalization benefit

Owner

ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. In ICCV, 2021.

FinEAS: Financial Embedding Analysis of Sentiment 📈

Implementation of a Transformer, but completely in Triton

Jittor implementation of PCT:Point Cloud Transformer

OneShot Learning-based hotword detection.

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

Anonymous implementation of KSL

Torchlight2 lan game server tool - A message forwarding tool for Torchlight 2 lan game

Wandb-predictions - WANDB Predictions With Python

Virtual Dance Reality Stage: a feature that offers you to share a stage with another user virtually

DumpSMBShare - A script to dump files and folders remotely from a Windows SMB share

Python library for tracking human heads with FLAME (a 3D morphable head model)

Official implementation of the paper "AAVAE: Augmentation-AugmentedVariational Autoencoders"

Tensorflow implementation for "Improved Transformer for High-Resolution GANs" (NeurIPS 2021).

Semi-supervised Video Deraining with Dynamical Rain Generator (CVPR, 2021, Pytorch)

High frequency AI based algorithmic trading module.

Neural Logic Inductive Learning

CellRank's reproducibility repository.

Ensembling Off-the-shelf Models for GAN Training

Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear Devices