Disturbing Target Values for Neural Network regularization: attacking the loss layer to prevent overfitting

Last update: Apr 24, 2022

Related tags

Overview

Disturbing Target Values for Neural Network regularization: attacking the loss layer to prevent overfitting

1. Classification Task

PyTorch implementation of DisturbLabel: Regularizing CNN on the Loss Layer [CVPR 2016] extended with Directional DisturbLabel method.

This classification code is built on top of https://github.com/amirhfarzaneh/disturblabel-pytorch/blob/master/README.md project and utilizes implementation from ResNet 18 from https://github.com/huyvnphan/PyTorch_CIFAR10

Directional DisturbLabel

  if args.mode == 'ddl' or args.mode == 'ddldr':
      out = F.softmax(output, dim=1)
      norm = torch.norm(out, dim=1)
      out = out / norm[:, None]
      idx = []
      for i in range(len(out)):
          if out[i,target[i]] > .5:
              idx.append(i)
              
      if len(idx) > 0:
          target[idx] = disturb(target[idx]).to(device)

Usage

python main_ddl.py --mode=dl --alpha=20

Most important arguments

--dataset - which data to use

Possible values:

value	dataset
MNIST	MNIST
FMNIST	Fashion MNIST
CIFAR10	CIFAR-10
CIFAR100	CIFAR-100
ART	Art Images: Drawing/Painting/Sculptures/Engravings
INTEL	Intel Image Classification

Default: MNIST

-- mode - regularization method applied

Possible values:

value	method
noreg	Without any regularization
dl	Vanilla DistrubLabel
ddl	Directional DisturbLabel
dropout	Dropout
dldr	DistrubLabel+Dropout
ddldl	Directional DL+Dropout

Default: ddl

--alpha - alpha for vanilla Distrub label and Directional DisturbLabel

Possible values: int from 0 to 100. Default: 20

--epochs - number of training epochs

Default: 100

2. Regression Task

DisturbValue

def noise_generator(x, alpha):
    noise = torch.normal(0, 1e-8, size=(len(x), 1))
    noise[torch.randint(0, len(x), (int(len(x)*(1-alpha)),))] = 0

    return noise

DisturbError

def disturberror(outputs, values):
    epsilon = 1e-8
    e = values - outputs
    for i in range(len(e)):
        if (e[i] < epsilon) & (e[i] >= 0):
            values[i] = values[i] + e[i] / 4
        elif (e[i] > -epsilon) & (e[i] < 0):
            values[i] = values[i] - e[i] / 4

    return values

Datasets

Boston: 506 instances, 13 features
Bike Sharing: 731 instances, 13 features
Air Quality(AQ): 9357 instances, 10 features
make_regression(MR): 5000 instances, 30 features (random sample for regression)
Housing Price - Kaggle(HP): 1460 instances, 81 features
Student Performance (SP): 649 instances, 13 features (20 - categorical were dropped)
Superconductivity Dataset (SD): 21263 instances, 81 features
Communities & Crime (CC): 1994 instances, 100 features
Energy Prediction (EP): 19735 instancies, 27 features

Experiment Setting

Model: MLP which has 3 hidden layers

Result: Averaged over 20 runs

Hyperparameters: Using grid search options

Usage

python main_new.py --de y --dataset "bike" --dv_annealing y --epoch 100 --T 80
python main_new.py --de y --dv y --dataset "bike" -epoch 100
python main_new.py --de y --l2 y --dataset "air" -epoch 100
python main_new.py --dv y --dv_annealing y --dataset "air" -epoch 100 #for annealing setting dv should be "y"

--dataset: 'bike', 'air', 'boston', 'housing', 'make_sklearn', 'superconduct', 'energy', 'crime', 'students'
--dropout, --dv(disturbvalue), --de(disturberror), --l2, --dv_annealing: (string) y / n
--lr: (float)
--batch_size, --epoch, --T(cos annealing T): (int)
-- default dv_annealing: alpha_min = 0.05, alpha_max = 0.12, T_i = 80

Disturbing Target Values for Neural Network regularization: attacking the loss layer to prevent overfitting

Related tags

Overview

Disturbing Target Values for Neural Network regularization: attacking the loss layer to prevent overfitting

1. Classification Task

Directional DisturbLabel

Usage

Most important arguments

2. Regression Task

DisturbValue

DisturbError

Datasets

Experiment Setting

Model: MLP which has 3 hidden layers

Result: Averaged over 20 runs

Hyperparameters: Using grid search options

Usage

Owner

Yongho Kim

Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

Official Implementation for Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation

Self Governing Neural Networks (SGNN): the Projection Layer

Locally Constrained Self-Attentive Sequential Recommendation

Code for Reciprocal Adversarial Learning for Brain Tumor Segmentation: A Solution to BraTS Challenge 2021 Segmentation Task

SwinIR: Image Restoration Using Swin Transformer

Collect super-resolution related papers, data, repositories

A implemetation of the LRCN in mxnet

This repository includes code of my study about Asynchronous in Frequency domain of GAN images.

PenguinSpeciesPredictionML - Basic model to predict Penguin species based on beak size and sex.

MM1 and MMC Queue Simulation using python - Results and parameters in excel and csv files

code for Grapadora research paper experimentation

This is the repo for Uncertainty Quantification 360 Toolkit.

Intel® Neural Compressor is an open-source Python library running on Intel CPUs and GPUs

Simple image captioning model - CLIP prefix captioning.

sequitur is a library that lets you create and train an autoencoder for sequential data in just two lines of code

MetaBalance: Improving Multi-Task Recommendations via Adapting Gradient Magnitudes of Auxiliary Tasks

Edge-aware Guidance Fusion Network for RGB-Thermal Scene Parsing

Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data based on Pytorch Framework

[NeurIPS 2021] A weak-shot object detection approach by transferring semantic similarity and mask prior.