Non-Autoregressive Predictive Coding

This repository contains the implementation of Non-Autoregressive Predictive Coding (NPC) as described in the preprint paper submitted to ICASSP 2021.

A quick example for training NPC

python main.py --config config/self_supervised/npc_example.yml \
               --task self-learning

For more complete examples including downstream tasks, please see the example script.
For preparing data, please visit preprocess.
For detailed hyperparameters setting and description, please checkout example config file of NPC.
For all run-time options, use -h flag.
Implementation of Autoregressive Predictive Coding (APC, 2019, Chung et al.) and Vector-Quantized APC (VQ-APC, 2020, Chung et al.) are also available using similar training/downstream execution with example config files here.

Some notes

We found the unmasked feature produced by the last ConvBlock layer a better representation. In the phone classification tasks, switching to the unmasked feature (PER 25.6%) provided a 1.6% improvement over the masked feature (PER 27.2%). Currently, this is not included in the preprint version and will be updated to the paper in the future. Please refer to downstream examples to activate this option.
APC/VQ-APC are implemented with the following modifications for improvement (for the unmodified version, please visit the official implementation of APC / VQAPC)
- Multi-group VQ available for VQ-APC, but with VQ on last layer only
- Using utterance-wised CMVN surface feature（just as NPC did)
- Using Gumbel Softmax from official API of pytorch
See package requirement for toolkits used, tensorboard can be used to access log files in --logdir.

Contact

Feel free to contact me for questions or feedbacks, my email can be found in the paper or my personal page.

Citation

If you find our work and/or this repository helpful, please do consider citing us

@article{liu2020nonautoregressive,
  title   = {Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies},
  author  = {Liu, Alexander and Chung, Yu-An and Glass, James},
  journal = {arXiv preprint arXiv:2011.00406},
  year    = {2020}
}

Non-Autoregressive Predictive Coding

Related tags

Overview

Non-Autoregressive Predictive Coding

Some notes

Contact

Citation

Owner

Alexander H. Liu

Th2En & Th2Zh: The large-scale datasets for Thai text cross-lingual summarization

Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": UNITER adversarial training part

FireFlyer Record file format, writer and reader for DL training samples.

spaCy plugin for Transformers , Udify, ELmo, etc.

DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

GAP-text2SQL: Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

:hot_pepper: R²SQL: "Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing." (AAAI 2021)

Contact Extraction with Question Answering.

Code for the ACL 2021 paper "Structural Guidance for Transformer Language Models"

End-to-end text to speech system using gruut and onnx. There are 40 voices available across 8 languages.

voice2json is a collection of command-line tools for offline speech/intent recognition on Linux

PyTorch implementation of NATSpeech: A Non-Autoregressive Text-to-Speech Framework

The official repository of the ISBI 2022 KNIGHT Challenge

Few-shot Natural Language Generation for Task-Oriented Dialog

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Code for producing Japanese GPT-2 provided by rinna Co., Ltd.

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

Index different CKAN entities in Solr, not just datasets