Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction

Last update: Dec 15, 2022

Overview

This is a fork of Fairseq(-py) with implementations of the following models:

Pervasive Attention - 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction

An NMT models with two-dimensional convolutions to jointly encode the source and the target sequences.

Pervasive Attention also provides an extensive decoding grid that we leverage to efficiently train wait-k models.

See README.

Efficient Wait-k Models for Simultaneous Machine Translation

Transformer Wait-k models (Ma et al., 2019) with unidirectional encoders and with joint training of multiple wait-k paths.

See README.

Fairseq Requirements and Installation

PyTorch version >= 1.4.0
Python version >= 3.6
For training new models, you'll also need an NVIDIA GPU and NCCL

Installing Fairseq

git clone https://github.com/elbayadm/attn2d
cd attn2d
pip install --editable .

License

fairseq(-py) is MIT-licensed. The license applies to the pre-trained models as well.

Citation

For Pervasive Attention, please cite:

@InProceedings{elbayad18conll,
    author ="Elbayad, Maha and Besacier, Laurent and Verbeek, Jakob",
    title = "Pervasive Attention: 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction",
    booktitle = "Proceedings of the 22nd Conference on Computational Natural Language Learning",
    year = "2018",
 }

For our wait-k models, please cite:

@article{elbayad20waitk,
    title={Efficient Wait-k Models for Simultaneous Machine Translation},
    author={Elbayad, Maha and Besacier, Laurent and Verbeek, Jakob},
    journal={arXiv preprint arXiv:2005.08595},
    year={2020}
}

For Fairseq, please cite:

@inproceedings{ott2019fairseq,
  title = {fairseq: A Fast, Extensible Toolkit for Sequence Modeling},
  author = {Myle Ott and Sergey Edunov and Alexei Baevski and Angela Fan and Sam Gross and Nathan Ng and David Grangier and Michael Auli},
  booktitle = {Proceedings of NAACL-HLT 2019: Demonstrations},
  year = {2019},
}

Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction

Related tags

Overview

Pervasive Attention - 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction

Efficient Wait-k Models for Simultaneous Machine Translation

Fairseq Requirements and Installation

License

Citation

Owner

Maha

Tensors and neural networks in Haskell

Starter kit for getting started in the Music Demixing Challenge.

Learn other languages using artificial intelligence with python.

CausaLM: Causal Model Explanation Through Counterfactual Language Models

Convolutional neural network that analyzes self-generated images in a variety of languages to find etymological similarities

Disentangled Cycle Consistency for Highly-realistic Virtual Try-On, CVPR 2021

A repository for storing njxzc final exam review material

neural image generation

Self-Supervised Image Denoising via Iterative Data Refinement

An onlinel learning to rank python codebase.

A simple and extensible library to create Bayesian Neural Network layers on PyTorch.

基于Flask开发后端、VUE开发前端框架，在WEB端部署YOLOv5目标检测模型

Neural-Pull: Learning Signed Distance Functions from Point Clouds by Learning to Pull Space onto Surfaces(ICML 2021)

Provided is code that demonstrates the training and evaluation of the work presented in the paper: "On the Detection of Digital Face Manipulation" published in CVPR 2020.

Source code for "UniRE: A Unified Label Space for Entity Relation Extraction.", ACL2021.

Official code for the ICCV 2021 paper "DECA: Deep viewpoint-Equivariant human pose estimation using Capsule Autoencoders"

The project page of paper: Architecture disentanglement for deep neural networks [ICCV 2021, oral]

Official implementation of our paper "Learning to Bootstrap for Combating Label Noise"

Official PyTorch implemention of our paper "Learning to Rectify for Robust Learning with Noisy Labels".

Offcial implementation of "A Hybrid Video Anomaly Detection Framework via Memory-Augmented Flow Reconstruction and Flow-Guided Frame Prediction, ICCV-2021".

Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction

Related tags

Overview

Pervasive Attention - 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction

Efficient Wait-k Models for Simultaneous Machine Translation

Fairseq Requirements and Installation

License

Citation

Owner

Maha

Tensors and neural networks in Haskell

Starter kit for getting started in the Music Demixing Challenge.

Learn other languages ​​using artificial intelligence with python.

CausaLM: Causal Model Explanation Through Counterfactual Language Models

Convolutional neural network that analyzes self-generated images in a variety of languages to find etymological similarities

Disentangled Cycle Consistency for Highly-realistic Virtual Try-On, CVPR 2021

A repository for storing njxzc final exam review material

neural image generation

Self-Supervised Image Denoising via Iterative Data Refinement

An onlinel learning to rank python codebase.

A simple and extensible library to create Bayesian Neural Network layers on PyTorch.

基于Flask开发后端、VUE开发前端框架，在WEB端部署YOLOv5目标检测模型

Neural-Pull: Learning Signed Distance Functions from Point Clouds by Learning to Pull Space onto Surfaces(ICML 2021)

Provided is code that demonstrates the training and evaluation of the work presented in the paper: "On the Detection of Digital Face Manipulation" published in CVPR 2020.

Source code for "UniRE: A Unified Label Space for Entity Relation Extraction.", ACL2021.

Official code for the ICCV 2021 paper "DECA: Deep viewpoint-Equivariant human pose estimation using Capsule Autoencoders"

The project page of paper: Architecture disentanglement for deep neural networks [ICCV 2021, oral]

Official implementation of our paper "Learning to Bootstrap for Combating Label Noise"

Official PyTorch implemention of our paper "Learning to Rectify for Robust Learning with Noisy Labels".

Offcial implementation of "A Hybrid Video Anomaly Detection Framework via Memory-Augmented Flow Reconstruction and Flow-Guided Frame Prediction, ICCV-2021".

Learn other languages using artificial intelligence with python.