Implementation of gMLP, an all-MLP replacement for Transformers, in Pytorch

Last update: Jan 02, 2023

Overview

gMLP - Pytorch

Implementation of gMLP, an all-MLP replacement for Transformers, in Pytorch

Install

$ pip install g-mlp-pytorch

Usage

For masked language modelling

import torch
from g_mlp_pytorch import gMLP

model = gMLP(
    num_tokens = 20000,
    dim = 512,
    depth = 6,
    seq_len = 256
)

x = torch.randint(0, 20000, (1, 256))
emb = model(x) # (1, 256, 512)

For image classification

import torch
from g_mlp_pytorch import gMLPVision

model = gMLPVision(
    image_size = 256,
    patch_size = 16,
    num_classes = 1000,
    dim = 512,
    depth = 6
)

img = torch.randn(1, 3, 256, 256)
pred = model(img) # (1, 1000)

You can also add a tiny amount of attention (one-headed) to boost performance, as mentioned in the paper as aMLP, with the addition of one extra keyword attn_dim. This applies to both gMLPVision and gMLP

import torch
from g_mlp_pytorch import gMLPVision

model = gMLPVision(
    image_size = 256,
    patch_size = 16,
    num_classes = 1000,
    dim = 512,
    depth = 6,
    attn_dim = 64
)

img = torch.randn(1, 3, 256, 256)
pred = model(img) # (1, 1000)

Citations

@misc{liu2021pay,
    title   = {Pay Attention to MLPs}, 
    author  = {Hanxiao Liu and Zihang Dai and David R. So and Quoc V. Le},
    year    = {2021},
    eprint  = {2105.08050},
    archivePrefix = {arXiv},
    primaryClass = {cs.LG}
}

Comments

Custom image sizes?

Hi, Thanks for your great (and very fast) contribution! I was wondering if you could help me figure out how to apply this to a different image size? It's not really an image, but rather a 2D dimensional tensor of 4096X100.

I saw that I can change the number of channels, so I could just set channels to be 1. But I see that firstly - your implementation is for squared images, and secondly, it requires that image size should be devisable by patch size.

Since you've written this implementation perhaps you could help me to adapt it for my needs? (and maybe other users for their cases).

Maybe I could pad the length to be 128 so both would be devisable by 16 for example? but then where do I set different h, w ?

Thanks.

opened by danarte 3
Parameter count doesnt line up with paper

Just a note (and correct me if I misunderstood the paper) -

The parameter count for the Tiny gMLP doesnt line up with the param count from the paper for 30 layers and 128 dim and 6 ff_mult. Thats probably due to the doubling of parameters here - https://github.com/lucidrains/g-mlp-pytorch/blob/main/g_mlp_pytorch/g_mlp_pytorch.py#L111

Halving this back to dim_ff + all 3 lines here need to halve their respective dims - https://github.com/lucidrains/g-mlp-pytorch/blob/main/g_mlp_pytorch/g_mlp_pytorch.py#L64-L66

Then param count is roughly 5.5 M params.

opened by titu1994 2
Add Support for Stochastic Depth

This PR adds support for stochastic depth, which is used in the paper for the vision experiments. I went ahead an added it to gMLP as well for completeness.

I tried my best to match your style. Let me know if there are any problems, or if you want me to refactor anything.

opened by mlw214 2

Don't you think this is more legible?

` class SpatialGatingUnit(nn.Module): def init(self, dim, dim_seq, causal = False, act = nn.Identity(), init_eps = 1e-3): super().init() dim_out = dim // 2 self.causal = causal

    self.norm = nn.LayerNorm(dim_out)
    #self.proj = nn.Conv1d(dim_seq, dim_seq, 1)

    self.dim_seq = dim_seq
    self.w_ = nn.Parameter(torch.zeros(dim_seq, dim_seq), requires_grad=True)   ####
    self.b_ = nn.Parameter(torch.ones(dim_seq), requires_grad=True)  ####

    self.act = act

    init_eps /= dim_seq
    #nn.init.uniform_(self.proj.weight, -init_eps, init_eps)
    #nn.init.constant_(self.proj.bias, 1.)

def forward(self, x, gate_res = None): # x -> bsz, len, hidden*6
    device, n = x.device, x.shape[1]

    res, gate = x.chunk(2, dim = -1)
    gate = self.norm(gate)

    weight, bias = self.w_, self.b_ # weight -> len, len, 1     bias -> len

    if self.causal:
        weight.unsqueeze(-1) # TODO
        weight, bias = weight[:n, :n], bias[:n]
        mask = torch.ones(weight.shape[:2], device = device).triu_(1).bool()
        weight = weight.masked_fill(mask[..., None], 0.)
        weight.squeeze(-1)# TODO

    gate = torch.matmul(weight, gate) + bias[None, :self.dim_seq, None]   # WZ + b

    #gate = F.conv1d(gate, weight, bias)   # WZ + b

    if exists(gate_res):
        gate = gate + gate_res

    return self.act(gate) * res

opened by ZIZUN 0

Potentially missing the high way pass

Hello,

Maybe I missed it, but would you mind pointing out where the high way pass of the gMLP block is in the code? Based on the paper, there is a high way path (addition) between the input and the output. I couldn't find it in the gMLPBlock code.

Thank you

opened by Vincent-Li-9701 1

Releases(0.1.5)

0.1.5(Aug 14, 2021)

Source code(tar.gz)
Source code(zip)
0.1.4(Aug 14, 2021)

Source code(tar.gz)
Source code(zip)
0.1.2(Aug 13, 2021)

Source code(tar.gz)
Source code(zip)
0.1.1(Aug 13, 2021)

Source code(tar.gz)
Source code(zip)
0.1.0(Aug 13, 2021)

Source code(tar.gz)
Source code(zip)
0.0.18(Jun 11, 2021)

Source code(tar.gz)
Source code(zip)
0.0.17(Jun 11, 2021)

Source code(tar.gz)
Source code(zip)
0.0.16(May 23, 2021)

Source code(tar.gz)
Source code(zip)
0.0.15(May 23, 2021)

Source code(tar.gz)
Source code(zip)
0.0.14(May 20, 2021)

Source code(tar.gz)
Source code(zip)
0.0.12(May 19, 2021)

Source code(tar.gz)
Source code(zip)
0.0.11(May 19, 2021)

Source code(tar.gz)
Source code(zip)
0.0.10(May 19, 2021)

Source code(tar.gz)
Source code(zip)
0.0.9(May 19, 2021)

Source code(tar.gz)
Source code(zip)
0.0.8(May 19, 2021)

Source code(tar.gz)
Source code(zip)
0.0.7(May 19, 2021)

Source code(tar.gz)
Source code(zip)
0.0.6(May 19, 2021)

Source code(tar.gz)
Source code(zip)
0.0.5a(May 19, 2021)

Source code(tar.gz)
Source code(zip)
0.0.4(May 18, 2021)

Source code(tar.gz)
Source code(zip)
0.0.3(May 18, 2021)

Source code(tar.gz)
Source code(zip)
0.0.2(May 18, 2021)

Source code(tar.gz)
Source code(zip)
0.0.1(May 18, 2021)

Source code(tar.gz)
Source code(zip)

Owner

Phil Wang

Working with Attention. It's all we need.

GitHub Repository

Main repository for the HackBio'2021 Virtual Internship Experience for #Team-Greider ❤️

Hello 🤟 #Team-Greider The team of 20 people for HackBio'2021 Virtual Bioinformatics Internship 💝 🖨️ 👨‍💻 HackBio: https://thehackbio.com 💬 Ask us

7 Oct 20, 2022

Optimal Camera Position for a Practical Application of Gaze Estimation on Edge Devices,

Optimal Camera Position for a Practical Application of Gaze Estimation on Edge Devices, Linh Van Ma, Tin Trung Tran, Moongu Jeon, ICAIIC 2022 (The 4th

11 Oct 10, 2022

Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference

Self-Supervised Document Similarity Ranking (SDR) via Contextualized Language Models and Hierarchical Inference This repo is the implementation for SD

36 Nov 28, 2022

This is the first released system towards complex meters` detection and recognition, which is implemented by computer vision techniques.

A three-stage detection and recognition pipeline of complex meters in wild This is the first released system towards detection and recognition of comp

19 Nov 28, 2022

offical implement of our Lifelong Person Re-Identification via Adaptive Knowledge Accumulation in CVPR2021

LifelongReID Offical implementation of our Lifelong Person Re-Identification via Adaptive Knowledge Accumulation in CVPR2021 by Nan Pu, Wei Chen, Yu L

76 Dec 08, 2022

Official code for Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

Official code for our Interspeech 2021 - Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset [1]*. Visually-grounded spoken language datasets c

3 Jan 26, 2022

Anomaly Localization in Model Gradients Under Backdoor Attacks Against Federated Learning

Federated_Learning This repo provides a federated learning framework that allows to carry out backdoor attacks under varying conditions. This is a ker

0 Nov 30, 2021

This is a repository for a No-Code object detection inference API using the OpenVINO. It's supported on both Windows and Linux Operating systems.

OpenVINO Inference API This is a repository for an object detection inference API using the OpenVINO. It's supported on both Windows and Linux Operati

68 Nov 24, 2022

Official implementation of "Membership Inference Attacks Against Self-supervised Speech Models"

Introduction Official implementation of "Membership Inference Attacks Against Self-supervised Speech Models". In this work, we demonstrate that existi

7 Nov 01, 2022

A Python implementation of active inference for Markov Decision Processes

A Python package for simulating Active Inference agents in Markov Decision Process environments. Please see our companion preprint on arxiv for an ove

235 Dec 21, 2022

Share a benchmark that can easily apply reinforcement learning in Job-shop-scheduling

Gymjsp Gymjsp is an open source Python library, which uses the OpenAI Gym interface for easily instantiating and interacting with RL environments, and

134 Dec 08, 2022

PoolFormer: MetaFormer is Actually What You Need for Vision

PoolFormer: MetaFormer is Actually What You Need for Vision (arXiv) This is a PyTorch implementation of PoolFormer proposed by our paper "MetaFormer i

1k Dec 30, 2022

PhysCap: Physically Plausible Monocular 3D Motion Capture in Real Time

PhysCap: Physically Plausible Monocular 3D Motion Capture in Real Time The implementation is based on SIGGRAPH Aisa'20. Dependencies Python 3.7 Ubuntu

124 Dec 08, 2022

Official code for "Maximum Likelihood Training of Score-Based Diffusion Models", NeurIPS 2021 (spotlight)

Maximum Likelihood Training of Score-Based Diffusion Models This repo contains the official implementation for the paper Maximum Likelihood Training o

84 Dec 12, 2022

Learning to Identify Top Elo Ratings with A Dueling Bandits Approach

Learning to Identify Top Elo Ratings We propose two algorithms MaxIn-Elo and MaxIn-mElo to solve the top players identification on the transitive and

2 Jan 14, 2022

pytorch, hand(object) detect ,yolo v5，手检测

YOLO V5 物体检测，包括手部检测。项目介绍手部检测手部检测示例如下：视频示例：项目配置作者开发环境： Python 3.7 PyTorch = 1.5.1 数据集手部检测数据集该项目数据集采用 TV-Hand 和 COCO-Hand (COCO-Hand-Big 部分) 进

11 Dec 20, 2022

Contenido del curso Bases de datos del DCC PUC versión 2021-2

IIC2413 - Bases de Datos Tabla de contenidos Equipo Profesores Ayudantes Contenidos Calendario Evaluaciones Resumen de notas Foro Política de integrid

54 Nov 23, 2022

OpenMMLab Computer Vision Foundation

English | 简体中文 Introduction MMCV is a foundational library for computer vision research and supports many research projects as below: MMCV: OpenMMLab

4.6k Jan 09, 2023

Fuse radar and camera for detection

SAF-FCOS: Spatial Attention Fusion for Obstacle Detection using MmWave Radar and Vision Sensor This project hosts the code for implementing the SAF-FC

18 Jan 01, 2023

Using Machine Learning to Create High-Res Fine Art

BIG.art: Using Machine Learning to Create High-Res Fine Art How to use GLIDE and BSRGAN to create ultra-high-resolution paintings with fine details By

13 Nov 27, 2022

Implementation of gMLP, an all-MLP replacement for Transformers, in Pytorch

Related tags

Overview

gMLP - Pytorch

Install

Usage

Citations

Comments

Custom image sizes?

Parameter count doesnt line up with paper

Add Support for Stochastic Depth

Don't you think this is more legible?

Potentially missing the high way pass

Releases(0.1.5)

0.1.5(Aug 14, 2021)

0.1.4(Aug 14, 2021)

0.1.2(Aug 13, 2021)

0.1.1(Aug 13, 2021)

0.1.0(Aug 13, 2021)

0.0.18(Jun 11, 2021)

0.0.17(Jun 11, 2021)

0.0.16(May 23, 2021)

0.0.15(May 23, 2021)

0.0.14(May 20, 2021)

0.0.12(May 19, 2021)

0.0.11(May 19, 2021)

0.0.10(May 19, 2021)

0.0.9(May 19, 2021)

0.0.8(May 19, 2021)

0.0.7(May 19, 2021)

0.0.6(May 19, 2021)

0.0.5a(May 19, 2021)

0.0.4(May 18, 2021)

0.0.3(May 18, 2021)

0.0.2(May 18, 2021)

0.0.1(May 18, 2021)

Owner

Phil Wang

Main repository for the HackBio'2021 Virtual Internship Experience for #Team-Greider ❤️

Optimal Camera Position for a Practical Application of Gaze Estimation on Edge Devices,

Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference

This is the first released system towards complex meters` detection and recognition, which is implemented by computer vision techniques.

offical implement of our Lifelong Person Re-Identification via Adaptive Knowledge Accumulation in CVPR2021

Official code for Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

Anomaly Localization in Model Gradients Under Backdoor Attacks Against Federated Learning

This is a repository for a No-Code object detection inference API using the OpenVINO. It's supported on both Windows and Linux Operating systems.

Official implementation of "Membership Inference Attacks Against Self-supervised Speech Models"

A Python implementation of active inference for Markov Decision Processes

Share a benchmark that can easily apply reinforcement learning in Job-shop-scheduling

PoolFormer: MetaFormer is Actually What You Need for Vision

PhysCap: Physically Plausible Monocular 3D Motion Capture in Real Time

Official code for "Maximum Likelihood Training of Score-Based Diffusion Models", NeurIPS 2021 (spotlight)

Learning to Identify Top Elo Ratings with A Dueling Bandits Approach

pytorch, hand(object) detect ,yolo v5，手检测

Contenido del curso Bases de datos del DCC PUC versión 2021-2

OpenMMLab Computer Vision Foundation

Fuse radar and camera for detection

Using Machine Learning to Create High-Res Fine Art