Code for Piggyback: Adapting a Single Network to Multiple Tasks by Learning to Mask Weights

Last update: Nov 22, 2022

Related tags

Overview

Piggyback: https://arxiv.org/abs/1801.06519

Pretrained masks and backbones are available here: https://uofi.box.com/s/c5kixsvtrghu9yj51yb1oe853ltdfz4q

Datasets in PyTorch format are available here: https://uofi.box.com/s/ixncr3d85guosajywhf7yridszzg5zsq
All rights belong to the respective publishers. The datasets are provided only to aid reproducibility.

The PyTorch-friendly Places365 dataset can be downloaded from http://places2.csail.mit.edu/download.html

Place masks in checkpoints/ and unzipped datasets in data/

	VGG-16	ResNet-50	DenseNet-121
CUBS	20.75	18.23	19.24
Stanford Cars	11.78	10.19	10.62
Flowers	6.93	4.77	4.91
WikiArt	29.80	28.57	29.33
Sketch	22.30	19.75	20.05

Note that the numbers in the paper are averaged over multiple runs for each ordering of datasets. These numbers were obtained by evaluating the models on a Titan X (Pascal). Note that numbers on other GPUs might be slightly different (~0.1%) owing to cudnn algorithm selection. https://discuss.pytorch.org/t/slightly-different-results-on-k-40-v-s-titan-x/10064

Requirements:

Python 2.7 or 3.xx
torch==0.2.0.post3
torchvision==0.1.9
torchnet (pip install git+https://github.com/pytorch/[email protected])
tqdm (pip install tqdm)

Run all code from the src/ directory, e.g. ./scripts/run_piggyback_training.sh

Training:

Check out src/scripts/run_piggyback_training.sh.

This script uses the default hyperparams and trains a model as described in the paper. The best performing model on the val set is saved to disk. This saved model includes the real-valued mask weights.

By default, we use the models provided by torchvision as our backbone networks. If you intend to evaluate with the masks provided by us, please use the correct version of torch and torchvision. In case you want to use a different version, but still want to use our masks, then download the pytorch_backbone networks provided in the box link above. Make appropriate changes to your pytorch code to load those backbone models.

Saving trained masks only.

Check out src/scripts/run_packing.sh.

This extracts the binary/ternary masks from the above trained models, and saves them separately.

Eval:

Use the saved masks, apply them to a backbone network and run eval.

By default, our backbone models are those provided with torchvision.
Note that to replicate our results, you have to use the package versions specified above.
Newer package versions might have different weights for the backbones, and the provided masks won't work.

cd src  # Run everything from src/

CUDA_VISIBLE_DEVICES=0 python pack.py --mode eval --dataset flowers \
  --arch vgg16 \
  --maskloc ../checkpoints/vgg16_binary.pt

Code for Piggyback: Adapting a Single Network to Multiple Tasks by Learning to Mask Weights

Related tags

Overview

Piggyback: https://arxiv.org/abs/1801.06519

Requirements:

Training:

Saving trained masks only.

Eval:

Owner

Arun Mallya

particle tracking model, works with the ROMS output file(qck.nc, his.nc)

GBK-GNN: Gated Bi-Kernel Graph Neural Networks for Modeling Both Homophily and Heterophily

Benchmark tools for Compressive LiDAR-to-map registration

Ensemble Visual-Inertial Odometry (EnVIO)

A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

Multi-angle c(q)uestion answering

Diabet Feature Engineering - Predict whether people have diabetes when their characteristics are specified

Pgn2tex - Scripts to convert pgn files to latex document. Useful to build books or pdf from pgn studies

Direct design of biquad filter cascades with deep learning by sampling random polynomials.

Simple helper library to convert a collection of numpy data to tfrecord, and build a tensorflow dataset from the tfrecord.

CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer

😇A pyTorch implementation of the DeepMoji model: state-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc

A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Review).

一个目标检测的通用框架(不需要cuda编译)，支持Yolo全系列(v2~v5)、EfficientDet、RetinaNet、Cascade-RCNN等SOTA网络。

Level Based Customer Segmentation

Energy consumption estimation utilities for Jetson-based platforms

시각 장애인을 위한 스마트 지팡이에 활용될 딥러닝 모델 (DL Model Repo)

[CVPR'21] Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild

Learning Features with Parameter-Free Layers (ICLR 2022)

SweiNet is an uncertainty-quantifying shear wave speed (SWS) estimator for ultrasound shear wave elasticity (SWE) imaging.