Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.
-
Updated
Jun 3, 2024 - Python
Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, TTS. Open Source, Local & Free.
TTS models for Arabic (Tacotron2, FastPitch)
an improved version of Real-time-voice-cloning
Synthese vocale avec conditionnement sur tres petit jeu de données. Utilisation des modeles Tacotron2 et WaveGlow de Nvidia avec Pytorch.
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Converting text to audio and applying audio augmentation
Code used in conjunction with an implementation of a Seq2Seq LSTM TTS frontend, to process and evaluate Google Research's Wikipedia Homograph Dataset (WHD) and LibriSpeech data, with the aim of improving the TTS frontend's homograph disambiguation abilities.
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
EC499: Major Project
TTS for pitch-accented language. Korean dialect DB.
This repository contain the code of the main part of my master thesis degree at Politecnico di Torino in Data science & Engineering
Add a description, image, and links to the tacotron2 topic page so that developers can more easily learn about it.
To associate your repository with the tacotron2 topic, visit your repo's landing page and select "manage topics."