😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
-
Updated
Nov 27, 2023 - Python
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
AdaSpeech: Adaptive Text to Speech for Custom Voice
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏
An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"
Desktop application for neural speech synthesis written in C++
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech ✊
The Implementation of FastSpeech2 Based on Pytorch.
Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
This is the experimental description of MnTTS2.
Refactored version of https://github.com/ming024/FastSpeech2
This repository contain the code of the main part of my master thesis degree at Politecnico di Torino in Data science & Engineering
Add a description, image, and links to the fastspeech2 topic page so that developers can more easily learn about it.
To associate your repository with the fastspeech2 topic, visit your repo's landing page and select "manage topics."