Simple torch.nn.module implementation of Alias-Free-GAN style filter and resample

Last update: Dec 22, 2022

Overview

Alias-Free-Torch

Simple torch module implementation of Alias-Free GAN.

This repository including

Alias-Free GAN style lowpass sinc filter @filter.py
Alias-Free GAN style up/downsample @resample.py
Alias-Free activation @act.py
and test codes @./test

Note: Since this repository is unofficial, filter and upsample could be different with official implementation.

Note: 2d lowpass filter is applying sinc instead of jinc (first order Bessel function of the first kind) in paper

Requirements

Due to torch.kaiser_window and torch.i0 are implemeted after 1.7.0, our repository need torch>=1.7.0.

Pytorch>=1.7.0

TODO

2d sinc filter
2d resample
devide 1d and 2d modules
pip packaging

Test results 1d

Filter sine	Filter noise

upsample	downsample

Test results 2d

Filter L1 norm sine	Filter noise

upsample	downsample

Activation

References

Alias-Free GAN
adefossez/julius
A. V. Oppenheim and R. W. Schafer. Discrete-Time Signal Processing. Pearson, International Edition, 3rd edition, 2010

Acknowledgement

This work is done at MINDsLab Inc.

Thanks to teammates at MINDsLab Inc.

Comments

Batched resampling for the new implementation

Hi, thank you very much for the contribution.

I think the new implementation of resample.Upsample1d and resample.Downsample1d breaks batched resampling when using groups=C without expanding the filter to match the shape. Perhaps the implementation should be like the below (maybe similar goes to 2d):

Upsample1d.forward()

    # x: [B,C,T]
    def forward(self, x):
        B, C, T = x.shape
        x = F.pad(x, (self.pad, self.pad), mode='reflect')
        # TConv with filter expanded to C with C groups for depthwise op
        x = self.ratio * F.conv_transpose1d(
            x, self.filter.expand(C, -1, -1), stride=self.stride, groups=C)
        pad_left = self.pad * self.stride + (self.kernel_size -
                                             self.stride) // 2
        pad_right = self.pad * self.stride + (self.kernel_size - self.stride +
                                              1) // 2
        x = x[..., pad_left:-pad_right]

LowPassFilter1d.forward()

    #input [B,C,T]
    def forward(self, x):
        B, C, T = x.shape
        if self.padding:
            x = F.pad(x, (self.left_pad, self.right_pad),
                      mode=self.padding_mode)
        # Conv with filter expanded to C with C groups for depthwise op
        out = F.conv1d(x, self.filter.expand(C, -1, -1), stride=self.stride, groups=C) # typo 'groupds' btw
        return out

Could you check the correctness? Thanks again for the implementation!

opened by L0SG 2

torch.speical.i1 typo

https://github.com/junjun3518/alias-free-torch/blob/f1fddd52fdd068ee475e82ae60c92e1bc24ffe02/src/alias_free_torch/filter.py#L22

At this line I believe you wanted torch.special.i1.

opened by torridgristle 2
"if self.pad / self.padding" in LowPassFilter2d

https://github.com/junjun3518/alias-free-torch/blob/258551410ff7bf02e06ece7c597466dc970fe5c7/src/alias_free_torch/filter.py#L165 https://github.com/junjun3518/alias-free-torch/blob/258551410ff7bf02e06ece7c597466dc970fe5c7/src/alias_free_torch/filter.py#L173

In LowPassFilter2d it looks like if self.pad: should change to if self.padding:, or self.padding = padding should change to self.pad = padding to match LowPassFilter1d.

opened by torridgristle 1
Padding Bool typo

https://github.com/junjun3518/alias-free-torch/blob/258551410ff7bf02e06ece7c597466dc970fe5c7/src/alias_free_torch/filter.py#L73

padding: bool: True, should be padding: bool = True,

I'm not sure if this causes an error with every version of PyTorch, but it does with PyTorch 1.12.0+cu113 on Python 3.7.13

opened by torridgristle 1
2D Filter Jinc appears to be wrong

Here is a plot of the generated 1D sinc filter kernel.

Here is a plot of the generated 2D jinc filter kernel.

I'd expect it to look more like a series of rings or ripples, rather than a donut or torus.

The FFT output for randn noise put through the 2D filter doesn't look right either.

Changing filter_ = 2 * cutoff * window * jinc(2 * cutoff * time) to filter_ = 2 * cutoff * window * sinc(2 * cutoff * time) in kaiser_jinc_filter2d makes a more familiar kernel.

And the FFT output for randn noise put through this 2D filter looks about how I'd expect.

opened by torridgristle 3

Releases(v0.0.6)

v0.0.6(Jul 26, 2022)

https://pypi.org/project/alias-free-torch/0.0.6/

Tested version
Source code(tar.gz)
Source code(zip)
v0.0.3(Jul 18, 2022)

https://pypi.org/project/alias-free-torch/0.0.3/

Bug fix for torch.special / remove print / split pad from conv_transpose
Source code(tar.gz)
Source code(zip)
v0.0.2(Jun 22, 2022)

https://pypi.org/project/alias-free-torch/0.0.2/

Rewrite upsample, jinc applied
Source code(tar.gz)
Source code(zip)
v0.0.1(Nov 2, 2021)

v0.0.1 released https://pypi.org/project/alias-free-torch/
Source code(tar.gz)
Source code(zip)

Owner

이준혁(Junhyeok Lee)

Audio/Speech Deep Learning Researcher @mindslab-ai

GitHub Repository

The official implementation of Equalization Loss for Long-Tailed Object Recognition (CVPR 2020) based on Detectron2

Equalization Loss for Long-Tailed Object Recognition Jingru Tan, Changbao Wang, Buyu Li, Quanquan Li, Wanli Ouyang, Changqing Yin, Junjie Yan ⚠️ We re

197 Dec 25, 2022

Large scale PTM - PPI relation extraction

Large-scale protein-protein post-translational modification extraction with distant supervision and confidence calibrated BioBERT The silver standard

1 Feb 25, 2022

Code and model benchmarks for "SEVIR : A Storm Event Imagery Dataset for Deep Learning Applications in Radar and Satellite Meteorology"

NeurIPS 2020 SEVIR Code for paper: SEVIR : A Storm Event Imagery Dataset for Deep Learning Applications in Radar and Satellite Meteorology Requirement

46 Dec 15, 2022

Neural Turing Machines (NTM) - PyTorch Implementation

PyTorch Neural Turing Machine (NTM) PyTorch implementation of Neural Turing Machines (NTM). An NTM is a memory augumented neural network (attached to

519 Dec 21, 2022

Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".

PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation Introduction Getting Started FSD50K Recipe AudioSet Recipe Label E

84 Dec 27, 2022

MODNet: Trimap-Free Portrait Matting in Real Time

MODNet is a model for real-time portrait matting with only RGB image input.

2.8k Dec 30, 2022

PED: DETR for Crowd Pedestrian Detection

PED: DETR for Crowd Pedestrian Detection Code for PED: DETR For (Crowd) Pedestrian Detection Paper PED: DETR for Crowd Pedestrian Detection Installati

36 Sep 13, 2022

[NeurIPS 2021] Garment4D: Garment Reconstruction from Point Cloud Sequences

Garment4D [PDF] | [OpenReview] | [Project Page] Overview This is the codebase for our NeurIPS 2021 paper Garment4D: Garment Reconstruction from Point

112 Dec 23, 2022

Official implementation of NeurIPS'2021 paper TransformerFusion

TransformerFusion: Monocular RGB Scene Reconstruction using Transformers Project Page | Paper | Video TransformerFusion: Monocular RGB Scene Reconstru

118 Dec 25, 2022

BisQue is a web-based platform designed to provide researchers with organizational and quantitative analysis tools for 5D image data. Users can extend BisQue by implementing containerized ML workflows.

Overview BisQue is a web-based platform specifically designed to provide researchers with organizational and quantitative analysis tools for up to 5D

26 Nov 29, 2022

시각 장애인을 위한 스마트 지팡이에 활용될 딥러닝 모델 (DL Model Repo)

SmartCane-DL-Model Smart Cane using semantic segmentation 참고한 Github repositoy 🔗 https://github.com/JunHyeok96/Road-Segmentation.git 데이터셋 🔗 https://

4 Dec 03, 2021

We will release the code of "ConTNet: Why not use convolution and transformer at the same time?" in this repo

ConTNet Introduction ConTNet (Convlution-Tranformer Network) is proposed mainly in response to the following two issues: (1) ConvNets lack a large rec

93 Nov 08, 2022

Semantically Contrastive Learning for Low-light Image Enhancement

Semantically Contrastive Learning for Low-light Image Enhancement Here, we propose an effective semantically contrastive learning paradigm for Low-lig

48 Dec 16, 2022

PaSST: Efficient Training of Audio Transformers with Patchout

PaSST: Efficient Training of Audio Transformers with Patchout This is the implementation for Efficient Training of Audio Transformers with Patchout Pa

165 Dec 26, 2022

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.

DD3D: "Is Pseudo-Lidar needed for Monocular 3D Object detection?" Install // Datasets // Experiments // Models // License // Reference Full video Offi

364 Dec 27, 2022

Simple torch.nn.module implementation of Alias-Free-GAN style filter and resample

Related tags

Overview

Alias-Free-Torch

Requirements

TODO

Test results 1d

Test results 2d

References

Acknowledgement

Comments

Batched resampling for the new implementation

torch.speical.i1 typo

"if self.pad / self.padding" in LowPassFilter2d

Padding Bool typo

2D Filter Jinc appears to be wrong

Releases(v0.0.6)

v0.0.6(Jul 26, 2022)

v0.0.3(Jul 18, 2022)

v0.0.2(Jun 22, 2022)

v0.0.1(Nov 2, 2021)