text-to-audio

Here are 36 public repositories matching this topic...

YingqingHe / Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

text-to-speech multimodality text-to-image text-to-audio text-to-video text-to-music multimodal-models aigc large-language-models text-to-3d multimodal-generation text-to-sound large-vision-language-models multimodal-large-language-models

Updated May 27, 2024
HTML

open-mmlab / Amphion

Star

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

text-to-speech audit speech-synthesis audio-synthesis music-generation voice-conversion text-to-audio fastspeech2 vits hifi-gan audio-generation singing-voice-conversion vall-e audioldm naturalspeech2

Updated May 24, 2024
Python

Text-to-Audio / Make-An-Audio

Star

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

latent-space video-to-audio diffusion-models text-to-audio latent-diffusion

Updated May 22, 2024
Python

gitmylo / audio-webui

Sponsor

Star

A webui for different audio related Neural Networks

music text-to-speech ai generative-audio aio artificial-intelligence tts bark rvc all-in-one generative-music voice-cloning text-to-audio audioldm audiocraft bark-gui rvc-gui

Updated May 18, 2024
Python

RhythrosaLabs / soundstorm

Star

Soundstorm is a cutting-edge AI-powered audio manipulation application designed to provide a rich yet simplified experience for sound designers, algorithmic composers, and experimental audio enthusiasts. From sample pack creation and algorithmic composition to AI text-to-audio and onscreen ChatGPT, Soundstorm is a sonic powerhouse.

midi chatbot sound sound-processing gpt algorithmic-music algorithmic-composition sounds audio-processing random-music audio-tools sound-design text-to-audio audio-toolbox ai-audio gpt-4 chatgpt chat-gpt ai-audio-generation

Updated May 4, 2024
Python

declare-lab / tango

Star

A family of diffusion models for text-to-audio generation.

language-models diffusion diffusion-models text-to-audio audio-generation large-language-models

Updated May 2, 2024
Python

Kartiksood10 / Text-to-Music-Generation-App

Star

Generate Music using natural language prompts using Meta's MusicGen Small Model.

python text-to-audio streamlit musicgen audiocraft

Updated Apr 18, 2024
Python

Consistency-TTA / consistency-tta.github.io

Star

Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation

audio diffusion-models text-to-audio audio-generation audio-diffusion

Updated Apr 12, 2024
HTML

Ate329 / SentiMusic

Star

A text-to-audio application that turns words and sentiments into melodies.

python music sentiment-analysis tensorflow music-composition sentiment transformers pytorch twitter-sentiment-analysis phi2 text-to-audio huggingface-transformers musicgen audiocraft twitter-roberta-base-sentiment

Updated Apr 4, 2024
Python

AMAAI-Lab / mustango

Star

Mustango: Toward Controllable Text-to-Music Generation

diffusion-models text-to-audio text-to-music large-language-models

Updated Mar 26, 2024
Python

happylittlecat2333 / Auffusion

Star

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"

diffusion diffusion-models text-to-audio audio-generation large-language-models

Updated Mar 25, 2024
Jupyter Notebook

inferless / bark

Star

Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. The model can also produce nonverbal communications like laughing, sighing and crying.

text-to-audio

Updated Feb 28, 2024
Python

vishalnagda1 / text-to-speech

Star

Python program to convert text to speech.

text-to-speech text-to-speech-python3 text-to-audio text-to-speech-app convert-text-to-audio

Updated Feb 27, 2024
Python

PapayaResearch / ctag

Star

Creative Text-to-Audio Generation via Synthesizer Programming @ NeurIPS'23 ML4Audio Workshop

machine-learning synthesizer jax text-to-audio generative-ai

Updated Feb 12, 2024
Python

open-v2ai / podcast-ai

Star

Whether it’s text or a link, it can be turned into a podcast!

text-to-speech podcast openai text-to-audio audio-ai

Updated Feb 6, 2024
TypeScript

ahsplore / TalkitOut-TTS-web-application-python

Star

TalkItOut is a Python and Flask-based web application that can convert text to speech, choose your preferred language for audio output, access a built-in dictionary for word meanings, and even extract text from images, complete with audio generation.