Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
-
Updated
Apr 12, 2024 - HTML
Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
Various projects utilizing diverse generative AI techniques to produce audio, code, images, text, and Streamlit applications.
Denoising Diffusion Implicit Models
Source code for "Visually aligned sound generation via sound-producing motion parsing" (Published at Neurocomputing)
The mel spectrogram generator using conditional WGAN-GP. For the mel spectrogram inverter, look up HiFi-GAN
Turn your words into music! Describe a sound (e.g., happy, spooky) and this app generates a short piece based on your text.
MIDI generator for chord progressions.
Site for sharing Bark voices
The service is used to query text-to-audio AI models from the Hugging Face inference API.
This repository is a comprehensive guide and toolkit for music generation, featuring diverse algorithms, deep learning models, and creative techniques to inspire and assist in the composition of unique musical pieces.
ai audio processing methods
Code implementation for the paper "Relating Human Perception of Musicality to Prediction in a Predictive Coding Model"
BeeBrain is your personal chatbot. Use tools, generate images, run code and so much more!
Experiments in neural networks for audio generation.
ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
Image Captioning and Text-to-Speech
Knowledge Distillation of different DDSP Decoders for audio signal generation
Text To Audio (Voice, Music) -Support Chat-GPT
AI 기반으로 스크립트부터 더빙, 이미지 생성까지 all in one 영상 제작 서비스
Add a description, image, and links to the audio-generation topic page so that developers can more easily learn about it.
To associate your repository with the audio-generation topic, visit your repo's landing page and select "manage topics."