This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).

Last update: Dec 05, 2022

Overview

Non-autoregressive Deep Learning-Based TTS Template

This is a template for the Non-autoregressive TTS model. It contains

Data Preprocessing Pipeline
Data Loader
Model / Trainer
Logger, Postprocessing (logging, synthesizing, plotting, etc..)

How to use it?

Clone the repository.

git clone https://github.com/keonlee9420/Deep-Learning-TTS-Template
cd Deep-Learning-TTS-Template

Replace all MYMODEL strings in this repo with your model name and also rename the file model/MYMODEL.py.
Build your model on model/ and check train.py and synthesize.py.
Use README_template.md for the README.md file of your project.
Feel free to add /img for your model architecture and tensorboard examples. It would also be nice to show your model's output audio in /demo.
Don't forget to update requirements.txt and /config of your project.

Citation

@misc{lee2021deep_learning_tts_template,
  author = {Lee, Keon},
  title = {Deep-Learning-TTS-Template},
  year = {2021},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/keonlee9420/Deep-Learning-TTS-Template}}
}

References

ming024's FastSpeech2

You might also like...

Pytorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

DiffSinger - PyTorch Implementation PyTorch implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension). Status

152 Jan 2, 2023

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

GradTTS Unofficial Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech" (arxiv) About this repo This is an unoffic

103 Dec 23, 2022

A non-linear, non-parametric Machine Learning method capable of modeling complex datasets

Fast Symbolic Regression Symbolic Regression is a non-linear, non-parametric Machine Learning method capable of modeling complex data sets. fastsr aim

3 Jun 22, 2022

A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection

Confluence: A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection 1. 介绍用以替代 NMS，在所有 bbox 中挑选出最优的集合。 NMS 仅考虑了 bbox 的得分，然后根据 IOU 来

44 Sep 15, 2022

This project uses Template Matching technique for object detecting by detection of template image over base image.

Object Detection Project Using OpenCV This project uses Template Matching technique for object detecting by detection the template image over base ima

7 May 29, 2022

This project uses Template Matching technique for object detecting by detection of template image over base image

Object Detection Project Using OpenCV This project uses Template Matching technique for object detecting by detection the template image over base ima

4 Nov 16, 2021

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

English | 简体中文 Why Non-Euclidean Geometry Considering these simple graph structures shown below. Nodes with same color has 2-hop distance whereas 1-ho

123 Dec 12, 2022

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

53 Dec 29, 2022

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

One model to speak them all 🌎 Audio Language Text ▷ Chinese 人人生而自由，在尊严和权利上一律平等。 ▷ English All human beings are born free and equal in dignity and rig

60 Nov 14, 2022

This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).

Related tags

Overview

Non-autoregressive Deep Learning-Based TTS Template

How to use it?

Citation

References

You might also like...

Pytorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

A non-linear, non-parametric Machine Learning method capable of modeling complex datasets

A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection

This project uses Template Matching technique for object detecting by detection of template image over base image.

This project uses Template Matching technique for object detecting by detection of template image over base image

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

Releases(v1.0.0)

v1.0.0(Jun 15, 2021)

Owner

Keon Lee

Neural Scene Flow Prior (NeurIPS 2021 spotlight)

Implementation of Graph Transformer in Pytorch, for potential use in replicating Alphafold2

Leaf: Multiple-Choice Question Generation

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Search and filter videos based on objects that appear in them using convolutional neural networks

This repository contains the entire code for our work "Two-Timescale End-to-End Learning for Channel Acquisition and Hybrid Precoding"

Using pretrained GROVER to extract the atomic fingerprints from molecule

[CVPR2021] De-rendering the World's Revolutionary Artefacts

PyTorch implementation of ICLR 2022 paper PiCO: Contrastive Label Disambiguation for Partial Label Learning

Ultra-Data-Efficient GAN Training: Drawing A Lottery Ticket First, Then Training It Toughly

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners for self-supervised ViT.

Styleformer - Official Pytorch Implementation

Library extending Jupyter notebooks to integrate with Apache TinkerPop and RDF SPARQL.

Adaptive Pyramid Context Network for Semantic Segmentation (APCNet CVPR'2019)

EMNLP 2020 - Summarizing Text on Any Aspects

Training Structured Neural Networks Through Manifold Identification and Variance Reduction

PyTorch implementation for OCT-GAN Neural ODE-based Conditional Tabular GANs (WWW 2021)

Fast and scalable uncertainty quantification for neural molecular property prediction, accelerated optimization, and guided virtual screening.

Source code and dataset of the paper "Contrastive Adaptive Propagation Graph Neural Networks forEfficient Graph Learning"