Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch

Last update: Dec 22, 2022

Overview

Cross Transformers - Pytorch (wip)

Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch

Install

$ pip install cross-transformers-pytorch

Usage

import torch
from torch import nn
import torch.nn.functional as F
from torchvision import models
from cross_transformers_pytorch import CrossTransformer

resnet = models.resnet34(pretrained = True)
model = nn.Sequential(*[*resnet.children()][:-2])

cross_transformer = CrossTransformer(
    dim = 512,
    dim_key = 128,
    dim_value = 128
)

# (batch, channels, height, width)
img_query = torch.randn(1, 3, 224, 224)

# (batch, classes, num supports, channels, height, width)
img_supports = torch.randn(1, 2, 4, 3, 224, 224)

labels = torch.randint(0, 2, (1,))

dists = cross_transformer(model, img_query, img_supports) # (1, 2)

loss = F.cross_entropy(dists, labels)
loss.backward()

Citations

@misc{doersch2020crosstransformers,
    title={CrossTransformers: spatially-aware few-shot transfer}, 
    author={Carl Doersch and Ankush Gupta and Andrew Zisserman},
    year={2020},
    eprint={2007.11498},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

You might also like...

Official PyTorch code for Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution (MANet, ICCV2021)

Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution (MANet, ICCV2021) This repository is the official PyTorc

139 Dec 29, 2022

Official PyTorch implementation of MX-Font (Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts)

Introduction Pytorch implementation of Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Expert. | paper Song Park1

97 Dec 23, 2022

Official PyTorch Implementation of Hypercorrelation Squeeze for Few-Shot Segmentation, arXiv 2021

Hypercorrelation Squeeze for Few-Shot Segmentation This is the implementation of the paper "Hypercorrelation Squeeze for Few-Shot Segmentation" by Juh

165 Dec 28, 2022

Pytorch implementation of few-shot semantic image synthesis

Few-shot Semantic Image Synthesis Using StyleGAN Prior Our method can synthesize photorealistic images from dense or sparse semantic annotations using

40 Sep 26, 2022

Pytorch Implementation for CVPR2018 Paper: Learning to Compare: Relation Network for Few-Shot Learning

LearningToCompare Pytorch Implementation for Paper: Learning to Compare: Relation Network for Few-Shot Learning Howto download mini-imagenet and make

246 Dec 19, 2022

Pytorch implementation of the paper "Optimization as a Model for Few-Shot Learning"

Optimization as a Model for Few-Shot Learning This repo provides a Pytorch implementation for the Optimization as a Model for Few-Shot Learning paper.

238 Jan 4, 2023

(ICCV'21) Official PyTorch implementation of Relational Embedding for Few-Shot Classification

Relational Embedding for Few-Shot Classification (ICCV 2021) Dahyun Kang, Heeseung Kwon, Juhong Min, Minsu Cho [paper], [project hompage] We propose t

82 Dec 24, 2022

PyTorch implementation of D2C: Diffuison-Decoding Models for Few-shot Conditional Generation.

D2C: Diffuison-Decoding Models for Few-shot Conditional Generation Project | Paper PyTorch implementation of D2C: Diffuison-Decoding Models for Few-sh

90 Dec 27, 2022

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

🦩 Flamingo - Pytorch Implementation of Flamingo, state-of-the-art few-shot visual question answering attention net, in Pytorch. It will include the p

630 Dec 28, 2022

Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch

Related tags

Overview

Cross Transformers - Pytorch (wip)

Install

Usage

Citations

You might also like...

Official PyTorch code for Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution (MANet, ICCV2021)

Official PyTorch implementation of MX-Font (Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts)

Official PyTorch Implementation of Hypercorrelation Squeeze for Few-Shot Segmentation, arXiv 2021

Pytorch implementation of few-shot semantic image synthesis

Pytorch Implementation for CVPR2018 Paper: Learning to Compare: Relation Network for Few-Shot Learning

Pytorch implementation of the paper "Optimization as a Model for Few-Shot Learning"

(ICCV'21) Official PyTorch implementation of Relational Embedding for Few-Shot Classification

PyTorch implementation of D2C: Diffuison-Decoding Models for Few-shot Conditional Generation.

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

Releases(0.0.2)

0.0.2(Mar 30, 2021)

0.0.1(Dec 16, 2020)

Owner

Phil Wang

Detectron2 for Document Layout Analysis

FasterAI: A library to make smaller and faster models with FastAI.

​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

This is a classifier which basically predicts whether there is a gun law in a state or not, depending on various things like murder rates etc.

One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking

A TikTok-like recommender system for GitHub repositories based on Gorse

Similarity-based Gray-box Adversarial Attack Against Deep Face Recognition

Self-Supervised Pillar Motion Learning for Autonomous Driving (CVPR 2021)

Simple (but Strong) Baselines for POMDPs

Code for CVPR2021 paper "Robust Reflection Removal with Reflection-free Flash-only Cues"

[NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images

Active and Sample-Efficient Model Evaluation

Code release to accompany paper "Geometry-Aware Gradient Algorithms for Neural Architecture Search."

Nested cross-validation is necessary to avoid biased model performance in embedded feature selection in high-dimensional data with tiny sample sizes

3D Pose Estimation for Vehicles

Prototypical Cross-Attention Networks for Multiple Object Tracking and Segmentation, NeurIPS 2021 Spotlight

An executor that performs image segmentation on fashion items

NumPy로 구현한 딥러닝 라이브러리입니다. (자동 미분 지원)

People movement type classifier with YOLOv4 detection and SORT tracking.

Supercharging Imbalanced Data Learning WithCausal Representation Transfer

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.