A PyTorch library for Vision Transformers

Last update: Nov 28, 2022

Related tags

Deep Learning vformer

Overview

VFormer

A PyTorch library for Vision Transformers

Getting Started

Read the contributing guidelines in CONTRIBUTING.rst to learn how to start contributing.

Comments

Add attention visualization methods
This article details different ways of visualizing a transformer's attention. It also talks about how such visualizations can aid in explainability of the models.

They also provide their code here.

We would like to have such visualization methods in the viz module.

good first issue
opened by NeelayS 7
Remove _Projection class

We can replace _Projection class with a one-liner if-else statement.

Should we replace it with if-else or should we keep the current implementation?

cc: @NeelayS @aditya-agrawal-30502 @alvanli

opened by abhi-glitchhg 6
Enhanced docstring

During the last PR (#45), I had to revert back because of compatibility issues

In this PR I have added some docstrings and Minor changes like changing variable names

this PR is the same as - #48 with edited title :)

@NeelayS

opened by abhi-glitchhg 3
Restructuring AbsolutePositionEmbedding class

AbsolutePositionEmbedding class was structured specifically for the PVT, but we can use it in other models too if we re-structure it properly, it should also support sinusoidal position embedding or a separate class for Sinusoidal embedding also works.
enhancement

opened by abhi-glitchhg 2
Add sharpness-aware optimizer

This paper describes how promoting smoothness with a recently proposed sharpness-aware optimizer substantially improves the performance of ViTs.

It would be good to have an implementation of this optimizer in our library. It would fit in the functional module.

A couple of PyTorch implementations are here and here.

opened by NeelayS 2
Documentation related to visualization methods

I have added some fixes for page breaks in #86.

Still, we need to enhance the docs for visualization methods.
We can include the license/copyright disclaimer for visualization methods in our license or have a separate file.

Additionally, we can add the sample outputs from these methods into the doc.

CC : @NeelayS @aditya-agrawal-30502 @alvanli
documentation enhancement good first issue

opened by abhi-glitchhg 1
[Paper] Visual Attention Network

paper - https://arxiv.org/abs/2202.09741 code- https://github.com/Visual-Attention-Network/VAN-Classification https://github.com/Visual-Attention-Network/VAN-Segmentation
Paper implementation

opened by abhi-glitchhg 0

Releases(v0.1.3)

v0.1.3(Jul 3, 2022)

Source code(tar.gz)
Source code(zip)
v0.1.2(Apr 7, 2022)

Source code(tar.gz)
Source code(zip)
v0.1.0(Feb 9, 2022)

First release of VFormer!
Source code(tar.gz)
Source code(zip)

Owner

Society for Artificial Intelligence and Deep Learning

GitHub Repository

Speech Recognition is an important feature in several applications used such as home automation, artificial intelligence

Speech Recognition is an important feature in several applications used such as home automation, artificial intelligence, etc. This article aims to provide an introduction on how to make use of the S

1 Feb 13, 2022

Machine learning library for fast and efficient Gaussian mixture models

This repository contains code which implements the Stochastic Gaussian Mixture Model (S-GMM) for event-based datasets Dependencies CMake Premake4 Blaz

1 Dec 19, 2022

Official PyTorch Implementation of Mask-aware IoU and maYOLACT Detector [BMVC2021]

The official implementation of Mask-aware IoU and maYOLACT detector. Our implementation is based on mmdetection. Mask-aware IoU for Anchor Assignment

46 Sep 29, 2022

Optimized Gillespie algorithm for simulating Stochastic sPAtial models of Cancer Evolution (OG-SPACE)

OG-SPACE Introduction Optimized Gillespie algorithm for simulating Stochastic sPAtial models of Cancer Evolution (OG-SPACE) is a computational framewo

0 Nov 17, 2021

Original code for "Zero-Shot Domain Adaptation with a Physics Prior"

Zero-Shot Domain Adaptation with a Physics Prior [arXiv] [sup. material] - ICCV 2021 Oral paper, by Attila Lengyel, Sourav Garg, Michael Milford and J

40 Dec 21, 2022

Pre-Training Graph Neural Networks for Cold-Start Users and Items Representation.

Pretrain-Recsys This is our Tensorflow implementation for our WSDM 2021 paper: Bowen Hao, Jing Zhang, Hongzhi Yin, Cuiping Li, Hong Chen. Pre-Training

30 Nov 14, 2022

Deep Learning Theory

Deep Learning Theory 整理了一些深度学习的理论相关内容，持续更新。 Overview Recent advances in deep learning theory 总结了目前深度学习理论研究的六个方向的一些结果，概述型，没做深入探讨(2021)。 1.1 complexity

103 Jan 04, 2023

DP-CL(Continual Learning with Differential Privacy)

DP-CL(Continual Learning with Differential Privacy) This is the official implementation of the Continual Learning with Differential Privacy. If you us

3 Nov 04, 2022

Motion planning environment for Sampling-based Planners

Sampling-Based Motion Planners' Testing Environment Sampling-based motion planners' testing environment (sbp-env) is a full feature framework to quick

23 Aug 23, 2022

Semantic Segmentation with Pytorch-Lightning

This is a simple demo for performing semantic segmentation on the Kitti dataset using Pytorch-Lightning and optimizing the neural network by monitoring and comparing runs with Weights & Biases.

58 Nov 18, 2022

[NeurIPS 2021] A weak-shot object detection approach by transferring semantic similarity and mask prior.

TransMaS This repository is the official pytorch implementation of the following paper: NIPS2021 Mixed Supervised Object Detection by TransferringMask

49 Jul 27, 2022

[ICCV'21] Official implementation for the paper Social NCE: Contrastive Learning of Socially-aware Motion Representations

CrowdNav with Social-NCE This is an official implementation for the paper Social NCE: Contrastive Learning of Socially-aware Motion Representations by

125 Dec 23, 2022

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

Language Emergence in Multi Agent Dialog Code for the Paper Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog Satwik Kottur, José M.

105 Nov 25, 2022

A PyTorch library for Vision Transformers

Related tags

Overview

VFormer

A PyTorch library for Vision Transformers

Getting Started

Comments

Add attention visualization methods

Remove _Projection class

Enhanced docstring

Restructuring AbsolutePositionEmbedding class

Add sharpness-aware optimizer

Documentation related to visualization methods

[Paper] Visual Attention Network

Releases(v0.1.3)

v0.1.3(Jul 3, 2022)

v0.1.2(Apr 7, 2022)

v0.1.0(Feb 9, 2022)

Owner

Society for Artificial Intelligence and Deep Learning

Speech Recognition is an important feature in several applications used such as home automation, artificial intelligence

Machine learning library for fast and efficient Gaussian mixture models

Official PyTorch Implementation of Mask-aware IoU and maYOLACT Detector [BMVC2021]

Optimized Gillespie algorithm for simulating Stochastic sPAtial models of Cancer Evolution (OG-SPACE)

Original code for "Zero-Shot Domain Adaptation with a Physics Prior"

Pre-Training Graph Neural Networks for Cold-Start Users and Items Representation.

Deep Learning Theory

DP-CL(Continual Learning with Differential Privacy)

Motion planning environment for Sampling-based Planners

Semantic Segmentation with Pytorch-Lightning

[NeurIPS 2021] A weak-shot object detection approach by transferring semantic similarity and mask prior.

[ICCV'21] Official implementation for the paper Social NCE: Contrastive Learning of Socially-aware Motion Representations

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

classify fashion-mnist dataset with pytorch

Lua-parser-lark - An out-of-box Lua parser written in Lark

A visualisation tool for Deep Reinforcement Learning

OMLT: Optimization and Machine Learning Toolkit

A hybrid framework (neural mass model + ML) for SC-to-FC prediction

Head2Toe: Utilizing Intermediate Representations for Better OOD Generalization

Code Release for Learning to Adapt to Evolving Domains