PyTorch implementation of the supervised learning experiments from the paper Model-Agnostic Meta-Learning (MAML)

Last update: Jan 05, 2023

Related tags

Overview

pytorch-maml

This is a PyTorch implementation of the supervised learning experiments from the paper Model-Agnostic Meta-Learning (MAML): https://arxiv.org/abs/1703.03400

Important: You will need the latest version of PyTorch, v.0.2.0 to run this code (otherwise you will get errors about double backwards not being supported).

Currently, only the Omniglot experiments have been replicated here. The hyper-parameters are the same as those used in the original Tensorflow implementation, except that only 1 random seed is used here.

5-way 1-shot training, best performance 98.9%

20-way 1-shot training, best performance 92%

Note: the 20-way performance is slightly lower than that reported in the paper (they report 95.8%). If you can see why this might be, please let me know. Also in this experiment, we can see evidence of overfitting to the meta-training set.

The 5-way results are achieved by simply meta-testing the network trained on the 1-shot task on the 5-shot task (e.g. for the 5-way 5-shot result, test the 5-way 1-shot trained network with 5-shots). Again the 20-way result is lower here than reported in the paper.

This repo also contains code for running maml experiments on permuted MNIST (tasks are created by shuffling the labels). This is a nice sanity check task.

license

This software is distributed under the MIT license.

to-do

port to pytorch 0.4 from 0.2 and python 3 from 2
investigate performance difference from TF version
add first-order version

PyTorch implementation of the supervised learning experiments from the paper Model-Agnostic Meta-Learning (MAML)

Related tags

Overview

pytorch-maml

license

to-do

Owner

Kate Rakelly

The dataset of tweets pulling from Twitters with keyword: Hydroxychloroquine, location: US, Time: 2020

Solver for Large-Scale Rank-One Semidefinite Relaxations

FedMM: Saddle Point Optimization for Federated Adversarial Domain Adaptation

Inferred Model-based Fuzzer

[AAAI22] Reliable Propagation-Correction Modulation for Video Object Segmentation

WatermarkRemoval-WDNet-WACV2021

A library to inspect itermediate layers of PyTorch models.

[ICCV 2021] Focal Frequency Loss for Image Reconstruction and Synthesis

PyTorch implementation for 3D human pose estimation

Continuum Learning with GEM: Gradient Episodic Memory

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

A cool little repl-based simulation written in Python

Unofficial pytorch implementation of 'Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization'

A complete end-to-end demonstration in which we collect training data in Unity and use that data to train a deep neural network to predict the pose of a cube. This model is then deployed in a simulated robotic pick-and-place task.

A Python library for common tasks on 3D point clouds

learned_optimization: Training and evaluating learned optimizers in JAX

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

Code and Datasets from the paper "Self-supervised contrastive learning for volcanic unrest detection from InSAR data"

Reading list for research topics in Masked Image Modeling

Free Book about Deep-Learning approaches for Chess (like AlphaZero, Leela Chess Zero and Stockfish NNUE)