Bottleneck Transformers for Visual Recognition

Last update: Jan 03, 2023

Overview

Bottleneck Transformers for Visual Recognition

Experiments

Model	Params (M)	Acc (%)
ResNet50 baseline (ref)	23.5M	93.62
BoTNet-50	18.8M	95.11%
BoTNet-S1-50	18.8M	95.67%
BoTNet-S1-59	27.5M	95.98%
BoTNet-S1-77	44.9M	wip

Summary

Usage (example)

Model

from model import Model

model = ResNet50(num_classes=1000, resolution=(224, 224))
x = torch.randn([2, 3, 224, 224])
print(model(x).size())

Module

from model import MHSA

resolution = 14
mhsa = MHSA(planes, width=resolution, height=resolution)

Reference

Paper link
Author: Aravind Srinivas, Tsung-Yi Lin, Niki Parmar, Jonathon Shlens, Pieter Abbeel, Ashish Vaswani
Organization: UC Berkeley, Google Research

Owner

Myeongjun Kim

Computer Vision Research using Deep Learning

GitHub Repository

E2e music remastering system - End-to-end Music Remastering System Using Self-supervised and Adversarial Training

End-to-end Music Remastering System This repository includes source code and pre

37 Dec 15, 2022

learned_optimization: Training and evaluating learned optimizers in JAX

learned_optimization: Training and evaluating learned optimizers in JAX learned_optimization is a research codebase for training learned optimizers. I

533 Dec 30, 2022

A TensorFlow 2.x implementation of Masked Autoencoders Are Scalable Vision Learners

Masked Autoencoders Are Scalable Vision Learners A TensorFlow implementation of Masked Autoencoders Are Scalable Vision Learners [1]. Our implementati

59 Dec 10, 2022

Official pytorch implementation of the paper: "SinGAN: Learning a Generative Model from a Single Natural Image"

SinGAN Project | Arxiv | CVF | Supplementary materials | Talk (ICCV`19) Official pytorch implementation of the paper: "SinGAN: Learning a Generative M

3.2k Dec 25, 2022

This repository holds code and data for our PETS'22 article 'From "Onion Not Found" to Guard Discovery'.

From "Onion Not Found" to Guard Discovery (PETS'22) This repository holds the code and data for our PETS'22 paper titled 'From "Onion Not Found" to Gu

3 May 04, 2022

Prototypical Networks for Few shot Learning in PyTorch

Prototypical Networks for Few shot Learning in PyTorch Simple alternative Implementation of Prototypical Networks for Few Shot Learning (paper, code)

835 Jan 08, 2023

Linear image-to-image translation

Linear (Un)supervised Image-to-Image Translation Examples for linear orthogonal transformations in PCA domain, learned without pairing supervision. Tr

40 Aug 31, 2022

Code for the paper "A Study of Face Obfuscation in ImageNet"

A Study of Face Obfuscation in ImageNet Code for the paper: A Study of Face Obfuscation in ImageNet Kaiyu Yang, Jacqueline Yau, Li Fei-Fei, Jia Deng,

35 Oct 04, 2022

PyTorch trainer and model for Sequence Classification

PyTorch-trainer-and-model-for-Sequence-Classification After cloning the repository, modify your training data so that the training data is a .csv file

2 Dec 09, 2022

Config files for my GitHub profile.

Canalyst Candas Data Science Library Name Canalyst Candas Description Built by a former PM / analyst to give anyone with a little bit of Python knowle

13 Jun 24, 2022

HNN: Human (Hollywood) Neural Network

HNN: Human (Hollywood) Neural Network Learn the top 1000 actors on IMDB with your very own low cost, highly parallel, CUDAless biological neural netwo

0 Dec 21, 2021

Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.

WECHSEL Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models. arXiv: https://arx

45 Dec 29, 2022

Bottleneck Transformers for Visual Recognition

Related tags

Overview

Bottleneck Transformers for Visual Recognition

Experiments

Summary

Usage (example)

Reference

Owner

Myeongjun Kim

E2e music remastering system - End-to-end Music Remastering System Using Self-supervised and Adversarial Training

learned_optimization: Training and evaluating learned optimizers in JAX

A TensorFlow 2.x implementation of Masked Autoencoders Are Scalable Vision Learners

Official pytorch implementation of the paper: "SinGAN: Learning a Generative Model from a Single Natural Image"

This repository holds code and data for our PETS'22 article 'From "Onion Not Found" to Guard Discovery'.

Prototypical Networks for Few shot Learning in PyTorch

Linear image-to-image translation

Code for the paper "A Study of Face Obfuscation in ImageNet"

PyTorch trainer and model for Sequence Classification

Config files for my GitHub profile.

HNN: Human (Hollywood) Neural Network

Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.

Framework for abstracting Amiga debuggers and access to AmigaOS libraries and devices.

Pytorch implementation of various High Dynamic Range (HDR) Imaging algorithms

Real-world Anomaly Detection in Surveillance Videos- pytorch Re-implementation

Bringing Computer Vision and Flutter together , to build an awesome app !!

The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

A Broader Picture of Random-walk Based Graph Embedding

Apply AnimeGAN-v2 across frames of a video clip

Python Rapid Artificial Intelligence Ab Initio Molecular Dynamics