Experiments for distributed optimization algorithms

Last update: Dec 04, 2022

Overview

Network-Distributed Algorithm Experiments

This repository contains a set of optimization algorithms and objective functions, and all code needed to reproduce experiments in:

"DESTRESS: Computation-Optimal and Communication-Efficient Decentralized Nonconvex Finite-Sum Optimization" [PDF]. (code is in this file [link])
"Communication-Efficient Distributed Optimization in Networks with Gradient Tracking and Variance Reduction" [PDF]. (code is in the previous version of this repo [link])

Due to the random data generation procedure, resulting graphs may be slightly different from those appeared in the paper, but conclusions remain the same.

If you find this code useful, please cite our papers:

@article{li2021destress,
  title={DESTRESS: Computation-Optimal and Communication-Efficient Decentralized Nonconvex Finite-Sum Optimization},
  author={Li, Boyue and Li, Zhize and Chi, Yuejie},
  journal={arXiv preprint arXiv:2110.01165},
  year={2021}
}

@article{li2020communication,
  title={Communication-Efficient Distributed Optimization in Networks with Gradient Tracking and Variance Reduction},
  author={Li, Boyue and Cen, Shicong and Chen, Yuxin and Chi, Yuejie},
  journal={Journal of Machine Learning Research},
  volume={21},
  pages={1--51},
  year={2020}
}

Implemented objective functions

The gradient implementations of all objective functions are checked numerically.

Linear regression

Linear regression with random generated data. The objective function is $f(w) = \frac{1}{N} \sum_i (y_i - x_i^\top w)^2$

Logistic regression

Logistic regression with $l$-2 or nonconvex regularization with random generated data or the Gisette dataset or datasets from libsvmtools. The objective function is $$ f(w) = - \frac{1}{N} * \Big(\sum_i y_i \log \frac{1}{1 + exp(w^T x_i)} + (1 - y_i) \log \frac{exp(w^T x_i)}{1 + exp(w^T x_i)} \Big) + \frac{\lambda}{2} | w |_2^2 + \alpha \sum_j \frac{w_j^2}{1 + w_j^2} $$

One-hidden-layer fully-connected neural netowrk

One-hidden-layer fully-connected neural network with softmax loss on the MNIST dataset.

Implemented optimization algorithms

Centralized optimization algorithms

Gradient descent
Stochastic gradient descent
Nesterov's accelerated gradient descent
SVRG
SARAH

Distributed optimization algorithms (i.e. with parameter server)

ADMM
DANE

Decentralized optimization algorithms

Decentralized gradient descent
Decentralized stochastic gradient descent
Decentralized gradient descent with gradient tracking
EXTRA
NIDS
Network-DANE/SARAH/SVRG
GT-SARAH
DESTRESS

Experiments for distributed optimization algorithms

Related tags

Overview

Network-Distributed Algorithm Experiments

Implemented objective functions

Linear regression

Logistic regression

One-hidden-layer fully-connected neural netowrk

Implemented optimization algorithms

Centralized optimization algorithms

Distributed optimization algorithms (i.e. with parameter server)

Decentralized optimization algorithms

Owner

Boyue Li

LightSeq is a high performance training and inference library for sequence processing and generation implemented in CUDA

A small demonstration of using WebDataset with ImageNet and PyTorch Lightning

Img-process-manual - Utilize Python Numpy and Matplotlib to realize OpenCV baisc image processing function

PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)

Node for thenewboston digital currency network.

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Imposter-detector-2022 - HackED 2022 Team 3IQ - 2022 Imposter Detector

Lava-DL, but with PyTorch-Lightning flavour

VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning

VID-Fusion: Robust Visual-Inertial-Dynamics Odometry for Accurate External Force Estimation

(NeurIPS 2021) Realistic Evaluation of Transductive Few-Shot Learning

This repository is an implementation of paper : Improving the Training of Graph Neural Networks with Consistency Regularization

A wrapper around SageMaker ML Lineage Tracking extending ML Lineage to end-to-end ML lifecycles, including additional capabilities around Feature Store groups, queries, and other relevant artifacts.

WSDM2022 "A Simple but Effective Bidirectional Extraction Framework for Relational Triple Extraction"

Nodule Generation Algorithm Baseline and template code for node21 generation track

Train a state-of-the-art yolov3 object detector from scratch!

Convert Pytorch model to onnx or tflite, and the converted model can be visualized by Netron

Medical-Image-Triage-and-Classification-System-Based-on-COVID-19-CT-and-X-ray-Scan-Dataset

[NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)