Single-step adversarial training (AT) has received wide attention as it proved to be both efficient and robust.

Last update: Sep 02, 2022

Related tags

Overview

Subspace Adversarial Training

Single-step adversarial training (AT) has received wide attention as it proved to be both efficient and robust. However, a serious problem of catastrophic overfitting exists, i.e., the robust accuracy against projected gradient descent (PGD) attack suddenly drops to 0% during the training. In this paper, we understand this problem from a novel perspective of optimization and firstly reveal the close link between the fast-growing gradient of each sample and overfitting, which can also be applied to understand the robust overfitting phenomenon in multi-step AT. To control the growth of the gradient during the training, we propose a new AT method, subspace adversarial training (Sub-AT), which constrains the AT in a carefully extracted subspace. It successfully resolves both two kinds of overfitting and hence significantly boosts the robustness. In subspace, we also allow single-step AT with larger steps and larger radius, which further improves the robustness performance. As a result, we achieve the state-of-the-art single-step AT performance: our pure single-step AT can reach over 51% robust accuracy against strong PGD-50 attack with radius 8/255 on CIFAR-10, even surpassing the standard multi-step PGD-10 AT with huge computational advantages.

Dependencies

Install required dependencies:

pip install -r requirements.txt

We also evaluate the robustness with Auto-Attack. It can be installed via following source code:

pip install git+https://github.com/fra31/auto-attack

How to run

We show sample usages in run.sh:

bash run.sh

For Tiny-ImageNet experiments, please prepare the dataset first under the path datasets/tiny-imagenet-200/.

For more detailed settings of different datasets, please refer to the supplementary material.

Single-step adversarial training (AT) has received wide attention as it proved to be both efficient and robust.

Related tags

Overview

Subspace Adversarial Training

Dependencies

How to run

Owner

Source code for Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

Official implementation of the NRNS paper: No RL, No Simulation: Learning to Navigate without Navigating

A Domain-Agnostic Benchmark for Self-Supervised Learning

ALBERT-pytorch-implementation - ALBERT pytorch implementation

Data and codes for ACL 2021 paper: Towards Emotional Support Dialog Systems

[ICCV'21] Neural Radiance Flow for 4D View Synthesis and Video Processing

Using contrastive learning and OpenAI's CLIP to find good embeddings for images with lossy transformations

Cache Requests in Deta Bases and Echo them with Deta Micros

Alfred-Restore-Iterm-Arrangement - An Alfred workflow to restore iTerm2 window Arrangements

This repository contains code and data for "On the Multimodal Person Verification Using Audio-Visual-Thermal Data"

Supervised Contrastive Learning for Product Matching

Boundary IoU API (Beta version)

yolov5目标检测模型的知识蒸馏（基于响应的蒸馏）

Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.

Generative Flow Networks for Discrete Probabilistic Modeling

A transformer model to predict pathogenic mutations

Molecular Sets (MOSES): A Benchmarking Platform for Molecular Generation Models

Planner_backend - Academic planner application designed for students and counselors.

A framework for annotating 3D meshes using the predictions of a 2D semantic segmentation model.

Invert and perturb GAN images for test-time ensembling