🔉 spafe: Simplified Python Audio Features Extraction
-
Updated
May 29, 2024 - Python
🔉 spafe: Simplified Python Audio Features Extraction
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A suite of speech signal processing tools
Speech, Language, Audio, Music Processing with Large Language Model
A PyTorch-based Speech Toolkit
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
A Comprehensive Survey of Mamba in Deep Learning
Source code of the paper "MARTA: a model for the automatic phonemic grouping of the parkinsonian speech"
😎 Awesome lists about Speech Emotion Recognition
✅ A list of speech recognition learning resources including courses, books, tutorials, papers and toolkits.
Multimodal Emotion eXpression Capture Amsterdam. Pipeline for capturing emotion expressions from multiple modalities (video, audio, text) in the wild.
Automated Reproducible Acoustical Analysis
This repository explores speech processing techniques like noise cancellation and speech segmentation through Python code.(Speech recognition soon)
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Python-based application designed to convert speech to text in real-time.
AI powered speech denoising and enhancement
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
A neural network framework for researchers studying acoustic communication
Introduction to Speech Processing
Add a description, image, and links to the speech-processing topic page so that developers can more easily learn about it.
To associate your repository with the speech-processing topic, visit your repo's landing page and select "manage topics."