Implements Gradient Centralization and allows it to use as a Python package in TensorFlow

Last update: Nov 01, 2022

Overview

Gradient Centralization TensorFlow

This Python package implements Gradient Centralization in TensorFlow, a simple and effective optimization technique for Deep Neural Networks as suggested by Yong et al. in the paper Gradient Centralization: A New Optimization Technique for Deep Neural Networks. It can both speedup training process and improve the final generalization performance of DNNs.

Installation

Run the following to install:

pip install gradient-centralization-tf

Usage

`gctf.centralized_gradients_for_optimizer`

Create a centralized gradients functions for a specified optimizer.

Arguments:

optimizer: a tf.keras.optimizers.Optimizer object. The optimizer you are using.

Example:

>>> opt = tf.keras.optimizers.Adam(learning_rate=0.1)
>>> optimizer.get_gradients = gctf.centralized_gradients_for_optimizer(opt)
>>> model.compile(optimizer = opt, ...)

`gctf.get_centralized_gradients`

Computes the centralized gradients.

This function is ideally not meant to be used directly unless you are building a custom optimizer, in which case you could point get_gradients to this function. This is a modified version of tf.keras.optimizers.Optimizer.get_gradients.

Arguments:

optimizer: a tf.keras.optimizers.Optimizer object. The optimizer you are using.
loss: Scalar tensor to minimize.
params: List of variables.

Returns:

A gradients tensor.

`gctf.optimizers`

Pre built updated optimizers implementing GC.

This module is speciially built for testing out GC and in most cases you would be using gctf.centralized_gradients_for_optimizer though this module implements gctf.centralized_gradients_for_optimizer. You can directly use all optimizers with tf.keras.optimizers updated for GC.

Example:

>>> model.compile(optimizer = gctf.optimizers.adam(learning_rate = 0.01), ...)
>>> model.compile(optimizer = gctf.optimizers.rmsprop(learning_rate = 0.01, rho = 0.91), ...)
>>> model.compile(optimizer = gctf.optimizers.sgd(), ...)

Returns:

A tf.keras.optimizers.Optimizer object.

Developing `gctf`

To install gradient-centralization-tf, along with tools you need to develop and test, run the following in your virtualenv:

git clone [email protected]:Rishit-dagli/Gradient-Centralization-TensorFlow
# or clone your own fork

pip install -e .[dev]

License

Copyright 2020 Rishit Dagli

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

Comments

On windows Tensorflow 2.5 it gives error

On windows 10 with miniconda enviroment tensorflow 2.5 gives error on centralized_gradients.py file.

the solution is change import keras.backend as K with import tensorflow.keras.backend as K
bug

opened by mgezer 5

The results in the mnist example are wrong/misleading

Describe the bug The results in your colab ipython notebook are misleading: https://colab.research.google.com/github/Rishit-dagli/Gradient-Centralization-TensorFlow/blob/main/examples/gctf_mnist.ipynb

In this example, the model is first trained with a normal Adam optimizer:

model.compile(optimizer = tf.keras.optimizers.Adam(),
              loss = 'sparse_categorical_crossentropy',
              metrics = ['accuracy'])

history_no_gctf = model.fit(training_images, training_labels, epochs=5, callbacks = [time_callback_no_gctf])

And afterwards the same model is recompiled with the gctf.optimizers.adam(). However, recompiling a keras model does not reset the weights. This means that in the first fit call the model is trained and then in the second fit call with the new optimizer the same model is used and of course then the results are better.

This can be fixed, by recreating the model for the second run, by just adding these few lines:

import gctf #import gctf

time_callback_gctf = TimeHistory()

# Model architecture
model = tf.keras.models.Sequential([
                                    tf.keras.layers.Flatten(), 
                                    tf.keras.layers.Dense(512, activation=tf.nn.relu),
                                    tf.keras.layers.Dense(256, activation=tf.nn.relu),
                                    tf.keras.layers.Dense(64, activation=tf.nn.relu),
                                    tf.keras.layers.Dense(512, activation=tf.nn.relu),
                                    tf.keras.layers.Dense(256, activation=tf.nn.relu),
                                    tf.keras.layers.Dense(64, activation=tf.nn.relu), 
                                    tf.keras.layers.Dense(10, activation=tf.nn.softmax)])

model.compile(optimizer = gctf.optimizers.adam(),
              loss = 'sparse_categorical_crossentropy',
              metrics=['accuracy'])

history_gctf = model.fit(training_images, training_labels, epochs=5, callbacks=[time_callback_gctf])

However, then the results are not better than without gctf:

Type                   Execution time    Accuracy      Loss
-------------------  ----------------  ----------  --------
Model without gctf:           24.7659    0.88825   0.305801
Model with gctf               24.7881    0.889567  0.30812

Could you please clarify what happens here. I tried this gctf.optimizers.adam() optimizer in my own research and it didn't change the results at all and now after seeing it doesn't work in the example which was constructed here. Makes me question the results of this paper.

To Reproduce Execute the colab file given in the repository: https://colab.research.google.com/github/Rishit-dagli/Gradient-Centralization-TensorFlow/blob/main/examples/gctf_mnist.ipynb

Expected behavior The right comparison would be if both models start from a random initialization, not that the second model can start with the already pre-trained weights.

Looking forward to a fast a swift explanation.

Best, Max

question

opened by themasterlink 2

Wider dependency requirements

The package as of now to be installed requires tensorflow ~= 2.4.0 and keras ~= 2.4.0. It turns out that this is sometimes problematic for folks who have custom installations of TensorFlow and a winder requirement could be set up.
enhancement

opened by Rishit-dagli 1
Release 0.0.3
This release includes some fixes and improvements

✅ Bug Fixes / Improvements

Allow wider versions for TensorFlow and Keras while installing the package (#14 )

Fixed incorrect usage example in docstrings and description for centralized_gradients_for_optimizer (#13 )

Add clear aims for each of the examples of using gctf (#15 )

Updates PyPi classifiers to clearly show the aims of this project. This should have no changes in the way you use this package (#18 )

Add clear instructions for using this with custom optimizers i.e. directly use get_centralized_gradients however a complete example has not been pushed due to the reasons mentioned in the issue (#16 )
opened by Rishit-dagli 0
Add an "About The Examples" section

Add an "About The Examples" section which contains a summary of the usage example notebooks and links to run it on Binder and Colab.

Close #15

opened by Rishit-dagli 0
Update relevant pypi classifiers
Add PyPI classifiers for:

Development status

Intended Audience

Topic

Further also added the Programming Language :: Python :: 3 :: Only classifer

Closes #18
opened by Rishit-dagli 0
Update pypi classifiers
I am specifically thinking of adding three more categories of pypi classifiers:

Development status

Intended Audience

Topic

Apart from this I also think it would be great to add the Programming Language :: Python :: 3 :: Only to make sure the audience to know that this package is intended for Python 3 only.
opened by Rishit-dagli 0
Add an "About the examples" section

It would be great to write an "About the example" section which could demonstrate in short what the example notebooks aim to achieve and show.
documentation

opened by Rishit-dagli 0
Error in usage example for gctf.centralized_gradients_for_optimizer

I noticed that the docstrings for gctf.centralized_gradients_for_optimizer have an error in the example usage section. The example creates an Adam optimizer instance and saves it to opt however the centralized_gradients_for_optimizer is applied on optimizer which ideally does not exist and running the example would result in an error.
documentation

opened by Rishit-dagli 0
[ImgBot] Optimize images

Beep boop. Your images are optimized!

Your image file size has been reduced by 19% 🎉

Details

| File | Before | After | Percent reduction | |:--|:--|:--|:--| | /images/gctf.png | 120.77kb | 98.16kb | 18.72% |

Black Lives Matter | 💰 donate | 🎓 learn | ✍🏾 sign

📝 docs | :octocat: repo | 🙋🏾 issues | 🏅 swag | 🏪 marketplace

opened by imgbot[bot] 0
[ImgBot] Optimize images

Beep boop. Your images are optimized!

Your image file size has been reduced by 19% 🎉

Details

| File | Before | After | Percent reduction | |:--|:--|:--|:--| | /images/gctf.png | 105.85kb | 86.11kb | 18.65% |

Black Lives Matter | 💰 donate | 🎓 learn | ✍🏾 sign

📝 docs | :octocat: repo | 🙋🏾 issues | 🏅 swag | 🏪 marketplace

opened by imgbot[bot] 0

Releases(v0.0.3)

v0.0.3(Mar 11, 2021)
This release includes some fixes and improvements

✅ Bug Fixes / Improvements

Allow wider versions for TensorFlow and Keras while installing the package (#14 )

Fixed incorrect usage example in docstrings and description for centralized_gradients_for_optimizer (#13 )

Add clear aims for each of the examples of using gctf (#15 )

Updates PyPi classifiers to clearly show the aims of this project. This should have no changes in the way you use this package (#18 )

Add clear instructions for using this with custom optimizers i.e. directly use get_centralized_gradients however a complete example has not been pushed due to the reasons mentioned in the issue (#16 )

Source code(tar.gz)
Source code(zip)
v0.0.2(Feb 21, 2021)
This release includes some fixes and improvements

✅ Bug Fixes / Improvements

Fix the issue of supporting multiple modules

Fix multiple typos.

Source code(tar.gz)
Source code(zip)
v0.0.1(Feb 20, 2021)
This is the initial version of the Gradient-Centralization-TensorFlow package.

Features:

Implement Gradient centralization for optimizers using tf.keras.optimizer.Optimizers base class

Supports custom optimizers

Pre-built optimizers implementing GC for testing purposes.

Thanks, @ialimustufa for his contributions to this package.
Source code(tar.gz)
Source code(zip)
gradient_centralization_tf-0.0.1-py3-none-any.whl(7.12 KB)

Owner

Rishit Dagli

High School, Ted-X, Ted-Ed speaker|Mentor, TFUG Mumbai|International Speaker|Microsoft Student Ambassador|#ExploreML Facilitator

GitHub Repository

Implementation of the method proposed in the paper "Neural Descriptor Fields: SE(3)-Equivariant Object Representations for Manipulation"

Neural Descriptor Fields (NDF) PyTorch implementation for training continuous 3D neural fields to represent dense correspondence across objects, and u

167 Jan 06, 2023

Few-NERD: Not Only a Few-shot NER Dataset

Few-NERD: Not Only a Few-shot NER Dataset This is the source code of the ACL-IJCNLP 2021 paper: Few-NERD: A Few-shot Named Entity Recognition Dataset.

319 Dec 30, 2022

Training PSPNet in Tensorflow. Reproduce the performance from the paper.

Training Reproduce of PSPNet. (Updated 2021/04/09. Authors of PSPNet have provided a Pytorch implementation for PSPNet and their new work with support

126 Jul 13, 2022

CenterNet:Objects as Points目标检测模型在Pytorch当中的实现

267 Dec 29, 2022

Online-compatible Unsupervised Non-resonant Anomaly Detection Repository

Online-compatible Unsupervised Non-resonant Anomaly Detection Repository Repository containing all scripts used in the studies of Online-compatible Un

0 Nov 09, 2021

PowerGridworld: A Framework for Multi-Agent Reinforcement Learning in Power Systems

PowerGridworld provides users with a lightweight, modular, and customizable framework for creating power-systems-focused, multi-agent Gym environments that readily integrate with existing training fr

37 Dec 17, 2022

MVGCN: a novel multi-view graph convolutional network (MVGCN) framework for link prediction in biomedical bipartite networks.

MVGCN MVGCN: a novel multi-view graph convolutional network (MVGCN) framework for link prediction in biomedical bipartite networks. Developer: Fu Hait

13 Dec 01, 2022

El-Gamal on Elliptic Curve (Python)

El-Gamal-on-EC El-Gamal on Elliptic Curve (Python) References: https://docsdrive.com/pdfs/ansinet/itj/2005/299-306.pdf https://arxiv.org/ftp/arxiv/pap

3 May 04, 2022

Code of U2Fusion: a unified unsupervised image fusion network for multiple image fusion tasks, including multi-modal, multi-exposure and multi-focus image fusion.

U2Fusion Code of U2Fusion: a unified unsupervised image fusion network for multiple image fusion tasks, including multi-modal (VIS-IR, medical), multi

129 Dec 11, 2022

OpenGAN: Open-Set Recognition via Open Data Generation

OpenGAN: Open-Set Recognition via Open Data Generation ICCV 2021 (oral) Real-world machine learning systems need to analyze novel testing data that di

90 Jan 06, 2023

Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy" (ICLR 2022 Spotlight)

About Code release for Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy (ICLR 2022 Spotlight)

221 Dec 31, 2022

Open-L2O: A Comprehensive and Reproducible Benchmark for Learning to Optimize Algorithms

Open-L2O This repository establishes the first comprehensive benchmark efforts of existing learning to optimize (L2O) approaches on a number of proble

161 Jan 02, 2023

Multi-layer convolutional LSTM with Pytorch

Convolution_LSTM_pytorch Thanks for your attention. I haven't got time to maintain this repo for a long time. I recommend this repo which provides an

733 Dec 30, 2022

Simply enable or disable your Nvidia dGPU

EnvyControl (WIP) Simply enable or disable your Nvidia dGPU Usage First clone this repo and install envycontrol with sudo pip install . CLI Turn off y

292 Jan 03, 2023

A production-ready, scalable Indexer for the Jina neural search framework, based on HNSW and PSQL

🌟 HNSW + PostgreSQL Indexer HNSWPostgreSQLIndexer Jina is a production-ready, scalable Indexer for the Jina neural search framework. It combines the

25 Oct 14, 2022

Implementation of Stochastic Image-to-Video Synthesis using cINNs.

Stochastic Image-to-Video Synthesis using cINNs Official PyTorch implementation of Stochastic Image-to-Video Synthesis using cINNs accepted to CVPR202

135 Dec 28, 2022

VOneNet: CNNs with a Primary Visual Cortex Front-End

VOneNet: CNNs with a Primary Visual Cortex Front-End A family of biologically-inspired Convolutional Neural Networks (CNNs). VOneNets have the followi

99 Dec 22, 2022

[ICML 2021, Long Talk] Delving into Deep Imbalanced Regression

Delving into Deep Imbalanced Regression This repository contains the implementation code for paper: Delving into Deep Imbalanced Regression Yuzhe Yang

568 Dec 30, 2022

This is the official repository of the paper Stocastic bandits with groups of similar arms (NeurIPS 2021). It contains the code that was used to compute the figures and experiments of the paper.

Experiments How to reproduce experimental results of Stochastic bandits with groups of similar arms submitted paper ? Section 5 of the paper To reprod

0 Oct 25, 2021

Automated detection of anomalous exoplanet transits in light curve data.

Automatically detecting anomalous exoplanet transits This repository contains the source code for the paper "Automatically detecting anomalous exoplan

1 Feb 01, 2022

Implements Gradient Centralization and allows it to use as a Python package in TensorFlow

Related tags

Overview

Gradient Centralization TensorFlow

Installation

Usage

Arguments:

Example:

Arguments:

Returns:

Example:

Returns:

Developing gctf

License

Comments

✅ Bug Fixes / Improvements

Beep boop. Your images are optimized!

Beep boop. Your images are optimized!

Releases(v0.0.3)

v0.0.3(Mar 11, 2021)

✅ Bug Fixes / Improvements

v0.0.2(Feb 21, 2021)

v0.0.1(Feb 20, 2021)

Owner

Rishit Dagli

Implementation of the method proposed in the paper "Neural Descriptor Fields: SE(3)-Equivariant Object Representations for Manipulation"

Few-NERD: Not Only a Few-shot NER Dataset

Training PSPNet in Tensorflow. Reproduce the performance from the paper.

CenterNet:Objects as Points目标检测模型在Pytorch当中的实现

Online-compatible Unsupervised Non-resonant Anomaly Detection Repository

PowerGridworld: A Framework for Multi-Agent Reinforcement Learning in Power Systems

MVGCN: a novel multi-view graph convolutional network (MVGCN) framework for link prediction in biomedical bipartite networks.

El-Gamal on Elliptic Curve (Python)

Code of U2Fusion: a unified unsupervised image fusion network for multiple image fusion tasks, including multi-modal, multi-exposure and multi-focus image fusion.

OpenGAN: Open-Set Recognition via Open Data Generation

Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy" (ICLR 2022 Spotlight)

Open-L2O: A Comprehensive and Reproducible Benchmark for Learning to Optimize Algorithms

Multi-layer convolutional LSTM with Pytorch

Simply enable or disable your Nvidia dGPU

A production-ready, scalable Indexer for the Jina neural search framework, based on HNSW and PSQL

Implementation of Stochastic Image-to-Video Synthesis using cINNs.

VOneNet: CNNs with a Primary Visual Cortex Front-End

[ICML 2021, Long Talk] Delving into Deep Imbalanced Regression

This is the official repository of the paper Stocastic bandits with groups of similar arms (NeurIPS 2021). It contains the code that was used to compute the figures and experiments of the paper.

Automated detection of anomalous exoplanet transits in light curve data.

Developing `gctf`