Overcoming-Catastrophic-forgetting-in-Neural-Networks

Elastic weight consolidation technique for incremental learning.

About

Use this API if you dont want your neural network to forget previously learnt tasks while doing transfer learning or domain adaption!

Results

The experiment is done as follow:

Train a 2 layer feed forward neural network on MNIST for 4 epochs
Train the same network later on Fashion-MNIST for 4 epochs This is done once with EWC and then without EWC and results are calculated on test data for both data on same model. Constant learning rate of 1e-4 is used throughout with Adam Optimizer. Importance multiplier is kept at 10e5 and sampling is done with half data before moving to next dataset

EWC	MNIST	Fashion-MNIST
Yes	70.27	81.88
No	48.43	86.69

Usage

from elastic_weight_consolidation import ElasticWeightConsolidation
# Build a neural network of your choice and pytorch dataset for it
# Define a criterion class for new task and pass it as shown below
ewc = ElasticWeightConsolidation(model, crit, lr=0.01, weight=0.1)
# Training procedure
for input, target in dataloader:
  ewc.forward_backward_update(input, target)
ewc.register_ewc_params(dataset, batch_size, num_batches_to_run_for_sampling)
# Repeat this for each new task and it's corresponding dataset

Reference

Paper

Elastic weight consolidation technique for incremental learning.

Related tags

Overview

Overcoming-Catastrophic-forgetting-in-Neural-Networks

About

Results

Usage

Reference

Owner

Shivam Saboo

An official implementation of the Anchor DETR.

This repository is an unoffical PyTorch implementation of Medical segmentation in 3D and 2D.

Decorator for PyMC3

Official code of the paper "Expanding Low-Density Latent Regions for Open-Set Object Detection" (CVPR 2022)

FocusFace: Multi-task Contrastive Learning for Masked Face Recognition

Official implementation for "Style Transformer for Image Inversion and Editing" (CVPR 2022)

Official code base for the poster "On the use of Cortical Magnification and Saccades as Biological Proxies for Data Augmentation" published in NeurIPS 2021 Workshop (SVRHM)

百度2021年语言与智能技术竞赛机器阅读理解Pytorch版baseline

RobustVideoMatting and background composing in one model by using onnxruntime.

An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi

Implementation of CVPR'2022:Surface Reconstruction from Point Clouds by Learning Predictive Context Priors

Code of the paper "Shaping Visual Representations with Attributes for Few-Shot Learning (ASL)".

Optimizing synthesizer parameters using gradient approximation

WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking

A task Provided by A respective Artenal Ai and Ml based Company to complete it

MINERVA: An out-of-the-box GUI tool for offline deep reinforcement learning

ROS Basics and TurtleSim

GndNet: Fast ground plane estimation and point cloud segmentation for autonomous vehicles using deep neural networks.

GarmentNets: Category-Level Pose Estimation for Garments via Canonical Space Shape Completion

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement