Born-Infeld (BI) for AI: Energy-Conserving Descent (ECD) for Optimization

This repository contains the code for the BBI optimizer, introduced in the paper Born-Infeld (BI) for AI: Energy-Conserving Descent (ECD) for Optimization. 2201.11137. It is implemented using Pytorch.

The repository also includes the code needed to reproduce all the experiments presented in the paper. In particular:

The BBI optimizer is implemented in the file inflation.py.
The jupyter notebooks with the synthetic experiments are in the folder synthetic. All the notebooks already include the output, and text files with results are also included in the folder. In particular
- The notebook ackley.ipynb can be used to reproduce the results in Sec. 4.1.
- The notebook zakharov.ipynb can be used to reproduce the results in Sec. 4.2.
- The notebook multi_basin.ipynb can be used to reproduce the results in Sec. 4.3.
The ML benchmarks described in Sec. 4.5 can be found in the folders CIFAR and MNIST. The notebooks already include some results that can be inspected, but not all the statistics that builds up the results in Table 2. In particular:
- CIFAR : The notebook CIFAR-notebook.ipynb uses hyperopt to estimate the best hyperparameters for each optimizer and then runs a long run with the best estimated hyperparamers. The results can be analyzed with the notebook analysis-cifar.ipynb, which can also be used to generate more runs with the best hyperparameters to gather more statistics. The subfolder results already includes some runs that can be inspected.
- MNIST: The notebooks mnist_scan_BBI.ipynb and mnist_scan_SGD.ipynb perform a grid scan using BBI and SGD, respectively and gather some small statistics. All the results are within the notebooks themselves.
The PDE experiments can be run by running the script script-PDE.sh as
```
bash script-PDE.sh
```
This will solve the PDE outlined in Sec. 4.4 and App. C multiple times with the same initialization. The hyperparameters are also kept fixed and can be obtained from the script itself. In particular:
- feature 1 means that an L2 regularization is added to the loss.
- seed specifies the seed, which fixes the initialization of the network. The difference between the different runs then is only due to the random bounces, which are not affected by this choice of the seed.
The folder results already includes some runs. The runs performed in this way are not noisy, i.e. the set of points sampled from the domain is kept fixed. To randomly change the points every "epoch" (1000 iterations), edit the file experiments/PDE_PoissonD.py by changing line 134 to self.update_points = True.

The code has been tested with Python 3.9, Pytorch 1.10, hyperopt 0.2.5. We ran the synthetic experiments and MNIST on a six-core i7-9850H CPU with 16 GB of RAM, while we ran the CIFAR and PDE experiments on a pair of GPUs. We tested both on a pair of NVIDIA GeForce RTX 2080 Ti and on a pair of NVIDIA Tesla V100-SXM2-16GB GPUs, coupled with 32 GB of RAM and AMD EPYC 7502P CPUs.

The Resnet-18 code (in experiments/models) and the utils.py helper functions are adapted from https://github.com/kuangliu/pytorch-cifar (MIT License).

Born-Infeld (BI) for AI: Energy-Conserving Descent (ECD) for Optimization

Related tags

Overview

Born-Infeld (BI) for AI: Energy-Conserving Descent (ECD) for Optimization

Owner

G. Bruno De Luca

Used to record WKU's utility bills on a regular basis.

HINet: Half Instance Normalization Network for Image Restoration

ktrain is a Python library that makes deep learning and AI more accessible and easier to apply

Self Governing Neural Networks (SGNN): the Projection Layer

End-To-End Optimization of LiDAR Beam Configuration

SSD: Single Shot MultiBox Detector pytorch implementation focusing on simplicity

PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners for self-supervised ViT.

Churn-Prediction-Project - In this project, a churn prediction model is developed for a private bank as a term project for Data Mining class.

PyoMyo - Python Opensource Myo library

PaddleBoBo是基于PaddlePaddle和PaddleSpeech、PaddleGAN等开发套件的虚拟主播快速生成项目

Official implementation for "Symbolic Learning to Optimize: Towards Interpretability and Scalability"

Pytorch Geometric Tutorials

A 1.3B text-to-image generation model trained on 14 million image-text pairs

Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes

Code for `BCD Nets: Scalable Variational Approaches for Bayesian Causal Discovery`, Neurips 2021

A Collection of LiDAR-Camera-Calibration Papers, Toolboxes and Notes

For AILAB: Cross Lingual Retrieval on Yelp Search Engine

Learning to Disambiguate Strongly Interacting Hands via Probabilistic Per-Pixel Part Segmentation [3DV 2021 Oral]

PyAF is an Open Source Python library for Automatic Time Series Forecasting built on top of popular pydata modules.

code for Image Manipulation Detection by Multi-View Multi-Scale Supervision