Anderson Accelerated Deep Learning (AADL)

AADL is a Python package that implements the Anderson acceleration to speed-up the training of deep learning (DL) models using the PyTorch library.
AA is an extrapolation technique that can accelerate fixed-point iterations such those arising from the iterative training of DL models. However, large volume of data are typically processed in sequential random batches which introduces stochastic oscillations in the fixed-point iteration that hinders AA acceleration. AADL implements a moving average that reduces the oscillations and results in a smoother sequence of gradient descent updates which enables the use of AA. AADL uses a criterion to automatically decide if the moving average is needed by monitoring if the relative standard deviation between consecutive stochastic gradient updates exceeds a tolerance defined by the user.

Requirements

Python 3.5 or greater
PyTorch (any version works)

Installation

AADL comes with a setuptools install script:

python3 setup.py install

Usage

import torch
import torch.nn
import torch.optim
import AADL

# Creation of the DL model (neural network)
class model(torch.nn.Module):
	...

# Definition of the stochastic optimizer used to train the model
optimizer = torch.optim.SGD(model.parameters(), lr=1e-3, momentum=0.9, nesterov = True)

# Parameters for Anderson acceleration
relaxation = 0.5
wait_iterations = 0
history_depth = 10
store_each_nth = 10
frequency = store_each_nth
reg_acc = 0.0
safeguard = True
average = True

# Over-writing of the torch.optim.step() method 
AADL.accelerate(optimizer_anderson, "anderson", relaxation, wait_iterations, history_depth, store_each_nth, frequency, reg_acc, average)

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

License

BSD-3-Clause

Citations

"AADL: Anderson Accelerated Deep Learning", Copyright ID#: 81927550 https://doi.org/10.11578/dc.20210723.1

Anderson Acceleration for Deep Learning

Related tags

Overview

Anderson Accelerated Deep Learning (AADL)

Requirements

Installation

Usage

Contributing

License

Citations

Owner

Oak Ridge National Laboratory

A nutritional label for food for thought.

CRNN With PyTorch

Learnable Boundary Guided Adversarial Training (ICCV2021)

Representing Long-Range Context for Graph Neural Networks with Global Attention

smc.covid is an R package related to the paper A sequential Monte Carlo approach to estimate a time varying reproduction number in infectious disease models: the COVID-19 case by Storvik et al

Prefix-Tuning: Optimizing Continuous Prompts for Generation

Deep Federated Learning for Autonomous Driving

FastFCN: Rethinking Dilated Convolution in the Backbone for Semantic Segmentation.

Numerai tournament example scripts using NN and optuna

Download from Onlyfans.com.

PyTorch implementation of Deformable Convolution

DrNAS: Dirichlet Neural Architecture Search

Facestar dataset. High quality audio-visual recordings of human conversational speech.

Understanding Convolutional Neural Networks from Theoretical Perspective via Volterra Convolution

Code of Puregaze: Purifying gaze feature for generalizable gaze estimation, AAAI 2022.

An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities.

Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis

Breast cancer is been classified into benign tumour and malignant tumour.

StyleGAN2 with adaptive discriminator augmentation (ADA) - Official TensorFlow implementation

Pseudo-mask Matters in Weakly-supervised Semantic Segmentation