This repo is the official implementation of "L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization".

Last update: Jul 14, 2022

Related tags

Deep Learning L2ight

Overview

L2ight

By Jiaqi Gu, Hanqing Zhu, Chenghao Feng, Zixuan Jiang, Ray T. Chen and David Z. Pan.

This repo is the official implementation of "L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization".

Introduction

L2ight is a closed-loop ONN on-chip learning framework to enable scalable ONN mapping and efficient in-situ learning. L2ight adopts a three-stage learning flow that first calibrates the complicated photonic circuit states under challenging physical constraints, then performs photonic core mapping via combined analytical solving and zeroth-order optimization. A subspace learning procedure with multi-level sparsity is integrated into L2ight to enable in-situ gradient evaluation and fast adaptation, unleashing the power of optics for real on-chip intelligence. L2ight outperforms prior ONN training protocols with 3-order-of-magnitude higher scalability and over 30X better efficiency, when benchmarked on various models and learning tasks. This synergistic framework is the first scalable on-chip learning solution that pushes this emerging field from intractable to scalable and further to efficient for next-generation self-learnable photonic neural chips.

Dependencies

Python >= 3.6
pyutils >= 0.0.1. See pyutils for installation.
pytorch-onn >= 0.0.1. See pytorch-onn for installation.
Python libraries listed in requirements.txt
NVIDIA GPUs and CUDA >= 10.2

Structures

core/
- models/
  - layers/
    - custom_conv2d and custom_linear layers
    - utils.py: sampler and profiler
  - sparse_bp_*.py: model definition
  - sparse_bp_base.py: base model definition; identity calibration and mapping codes.
- optimizer/: mixedtrain and flops optimizers
- builder.py: build training utilities
script/: contains experiment scripts
train_pretrain.py, train_map.py, train_learn.py, train_zo_learn.py: training logic
compare_gradient.py: compare approximated gradients with true gradients for ablation

Usage

Pretrain model.
> python3 train_pretrain.py config/cifar10/vgg8/pretrain.yml
Identity calibration and parallel mapping. Please set your hyperparameters in CONFIG=config/cifar10/vgg8/pm/pm.yml and run
> python3 train_map.py CONFIG --checkpoint.restore_checkpoint=path/to/your/pretrained/checkpoint
Subspace learning with multi-level sampling. Please set your hyperparameters in CONFIG=config/cifar10/vgg8/ds/learn.yml and run
> python3 train_learn.py CONFIG --checkpoint.restore_chekcpoint=path/to/your/mapped/checkpoint --checkpoint.resume=1
All scripts for experiments are in ./script. For example, to run subspace learning with feedback sampling, column sampling, and data sampling, you can write proper task setting in SCRIPT=script/vgg8/train_ds_script.py and run
> python3 SCRIPT
Comparison experiments with RAD [ICLR 2021] and SWAT-U [NeurIPS 2020]. Run with the SCRIPT=script/vgg8/train_rad_script.py and script/vgg8/train_swat_script.py,
> python3 SCRIPT
Comparison with FLOPS [DAC 2020] and MixedTrn [AAAI 2021]. Run with the METHOD=mixedtrain or flops,
> python3 train_zo_learn.py config/mnist/cnn3/METHOD/learn.yml

Citing L2ight

@inproceedings{gu2021L2ight,
  title={L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization},
  author={Jiaqi Gu and Hanqing Zhu and Chenghao Feng and Zixuan Jiang and Ray T. Chen and David Z. Pan},
  journal={Conference on Neural Information Processing Systems (NeurIPS)},
  year={2021}
}

This repo is the official implementation of "L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization".

Related tags

Overview

L2ight

Introduction

Dependencies

Structures

Usage

Citing L2ight

Owner

Jiaqi Gu

Repo for code associated with Modeling the Mitral Valve.

Hypersearch weight debugging and losses tutorial

Source code for "Progressive Transformers for End-to-End Sign Language Production" (ECCV 2020)

A multi-functional library for full-stack Deep Learning. Simplifies Model Building, API development, and Model Deployment.

Datasets, tools, and benchmarks for representation learning of code.

A state-of-the-art semi-supervised method for image recognition

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

The openspoor package is intended to allow easy transformation between different geographical and topological systems commonly used in Dutch Railway

An open-source, low-cost, image-based weed detection device for fallow scenarios.

Official implementation of the paper ``Unifying Nonlocal Blocks for Neural Networks'' (ICCV'21)

Official Pytorch Implementation of Unsupervised Image Denoising with Frequency Domain Knowledge

A PyTorch implementation of Implicit Q-Learning

StyleSwin: Transformer-based GAN for High-resolution Image Generation

End-to-end Temporal Action Detection with Transformer. [Under review]

Hierarchical Few-Shot Generative Models

Fast, modular reference implementation and easy training of Semantic Segmentation algorithms in PyTorch.

Simple converter for deploying Stable-Baselines3 model to TFLite and/or Coral

Empower Sequence Labeling with Task-Aware Language Model

This program will stylize your photos with fast neural style transfer.

Robotic Process Automation in Windows and Linux by using Driagrams.net BPMN diagrams.