Canonical Capsules: Unsupervised Capsules in Canonical Pose (NeurIPS 2021)

Last update: Dec 07, 2022

Related tags

Overview

Canonical Capsules: Unsupervised Capsules in Canonical Pose (NeurIPS 2021)

Introduction

This is the official repository for the PyTorch implementation of "Canonical Capsules: Unsupervised Capsules in Canonical Pose" by Weiwei Sun*, Andrea Tagliasacchi*, Boyang Deng, Sara Sabour, Soroosh Yazdani, Geoffrey Hinton, Kwang Moo Yi.

Download links

Project Website
PDF (arXiv)
PDF (github copy)

Citation

⚠️ If you use this source core or data in your research (in any shape or format), we require you to cite our paper as:

@conference{sun2020canonical,
   title={Canonical Capsules: Unsupervised Capsules in Canonical Pose},
   author={Weiwei Sun and Andrea Tagliasacchi and Boyang Deng and 
           Sara Sabour and Soroosh Yazdani and Geoffrey Hinton and
           Kwang Moo Yi},
   booktitle={Neural Information Processing Systems},
   year={2021}
}

Requirements

Please install dependencies with the provided environment.yml:

conda env create -f environment.yml

Datasets

We use the ShapeNet dataset as in AtlasNetV2: download the data from AtlasNetV2's official repo and convert the downloaded data into h5 files with the provided script (i.e., data_utils/ShapeNetLoader.py).
For faster experimentation, please use our 2D planes dataset, which we generated from ShapeNet (please cite both our paper, as well as ShapeNet if you use this dataset).

Training/testing (2D)

To train the model on 2D planes (training of network takes only 50 epochs, and one epoch takes approximately 2.5 minutes on an NVIDIA GTX 1080 Ti):

./main.py --log_dir=plane_dim2 --indim=2 --scheduler=5

To visualize the decompostion and reconstruction:

./main.py --save_dir=gifs_plane2d --indim=2 --scheduler=5 --mode=vis --pt_file=logs/plane_dim2/checkpoint.pth

Training/testing (3D)

To train the model on the 3D dataset:

./main.py --log_dir=plane_dim3 --indim=3 --cat_id=-1

We test the model with:

./main.py --log_dir=plane_dim3 --indim=3 --cat_id=-1 --mode=test

Note that the option cat_id indicates the category id to be used to load the corresponding h5 files (this look-up table):

id	category
-1	all
0	bench
1	cabinet
2	car
3	cellphone
4	chair
5	couch
6	firearm
7	lamp
8	monitor
9	plane
10	speaker
11	table
12	watercraft

Pre-trained models (3D)

We release the 3D pretrained models for both single categy (airplanes), as well as multi-category (all 13 classes).

Classification

To use our classification script:

python classification.py --data_dir=/path/to/saved/features --feature_type=caca --method_type=svm --use_kpts

Canonical Capsules: Unsupervised Capsules in Canonical Pose (NeurIPS 2021)

Related tags

Overview

Canonical Capsules: Unsupervised Capsules in Canonical Pose (NeurIPS 2021)

Introduction

Download links

Citation

Requirements

Datasets

Training/testing (2D)

Training/testing (3D)

Pre-trained models (3D)

Classification

Owner

A Jinja extension (compatible with Flask and other frameworks) to compile and/or compress your assets.

Code and datasets for the paper "Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction" (RA-L, 2021)

OCR-D wrapper for detectron2 based segmentation models

This repository stores the code to reproduce the results published in "TiWS-iForest: Isolation Forest in Weakly Supervised and Tiny ML scenarios"

Code for models used in Bashiri et al., "A Flow-based latent state generative model of neural population responses to natural images".

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)

A dead simple python wrapper for darknet that works with OpenCV 4.1, CUDA 10.1

🌎 The Modern Declarative Data Flow Framework for the AI Empowered Generation.

End-To-End Optimization of LiDAR Beam Configuration

Official implementation of Deep Burst Super-Resolution

A Deep Learning Framework for Neural Derivative Hedging

Generative Art Using Neural Visual Grammars and Dual Encoders

Does Pretraining for Summarization Reuqire Knowledge Transfer?

Sequence to Sequence Models with PyTorch

natural image generation using ConvNets

A JAX-based research framework for writing differentiable numerical simulators with arbitrary discretizations

[ICCV'21] Pri3D: Can 3D Priors Help 2D Representation Learning?

Official Repo for ICCV2021 Paper: Learning to Regress Bodies from Images using Differentiable Semantic Rendering

TransCD: Scene Change Detection via Transformer-based Architecture

SatelliteNeRF - PyTorch-based Neural Radiance Fields adapted to satellite domain