#

speech-processing

Here are 565 public repositories matching this topic...

spafe

SuperKogito / spafe

🔉 spafe: Simplified Python Audio Features Extraction

Updated May 29, 2024
Python

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-recognition speaker-verification speech-processing speaker-diarization voice-activity-detection speech-activity-detection speaker-change-detection speaker-embedding overlapped-speech-detection

Updated May 28, 2024
Jupyter Notebook

sp-nitech / SPTK

A suite of speech signal processing tools

cpp signal-processing dsp speech lpc unix-command mfcc speech-processing audio-processing lsp sptk cepstrum

Updated May 28, 2024
C++

ddlBoJack / SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

speech-processing audio-processing peft music-processing large-language-model multimodal-large-language-models

Updated May 28, 2024
Python

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Updated May 27, 2024
Python

RuntimeSpeechRecognizer

gtreshchev / RuntimeSpeechRecognizer

Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.

voice-recognition speech-recognition openai unreal-engine ue4 speech-to-text whisper speech-processing audio-processing unreal-engine-4 ue4-plugin speech-detection whis ue5 unreal-engine-5 ue5-plugin whisper-cpp whisper-ai

Updated May 27, 2024
C++

xmindflow / Awesome_Mamba

A Comprehensive Survey of Mamba in Deep Learning

natural-language-processing computer-vision deep-learning time-series medical-imaging remote-sensing speech-processing mamba image-enhancement state-space-model gnn large-language-models llm mamba-state-space-models

Updated May 27, 2024

BYO-UPM / MARTA

Source code of the paper "MARTA: a model for the automatic phonemic grouping of the parkinsonian speech"

machine-learning deep-learning speech speech-recognition speech-processing parkinsons-disease gmvae

Updated May 28, 2024
Jupyter Notebook

abikaki / awesome-speech-emotion-recognition

😎 Awesome lists about Speech Emotion Recognition

machine-learning awesome deep-neural-networks deep-learning emotion artificial-intelligence awesome-list human-computer-interaction speech-processing affective-computing sentiment-classification emotion-detection emotion-recognition multimodal-sentiment-analysis speech-emotion-recognition expressive-speech-synthesis multimodal-emotion-recognition emotional-speech speech-emotion-classification

Updated May 23, 2024

weimeng23 / speech-recognition-learning-resources

✅ A list of speech recognition learning resources including courses, books, tutorials, papers and toolkits.

machine-learning deep-learning speech speech-recognition courses speech-to-text speech-processing asr

Updated May 22, 2024

mexca / mexca

Multimodal Emotion eXpression Capture Amsterdam. Pipeline for capturing emotion expressions from multiple modalities (video, audio, text) in the wild.

python docker computer-vision sentiment-analysis pytorch speech-to-text speech-processing emotion-recognition

Updated May 21, 2024
Python

ddlBoJack / Speech-Resources

语音方向实验室/公司/资源/实习等，欢迎推荐或自荐

speech speech-processing

Updated May 21, 2024

Voice-Lab / VoiceLab

Automated Reproducible Acoustical Analysis

python python3 open-science speech-processing voice-analysis acoustic-analysis voice-manipulation

Updated May 27, 2024
Python

Nourine-Nadir / Speech_Processing

This repository explores speech processing techniques like noise cancellation and speech segmentation through Python code.(Speech recognition soon)

artificial-intelligence speech-processing noise-cancellation speech-segmentation

Updated May 20, 2024
Jupyter Notebook

IMS-Toucan

DigitalPhonetics / IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

text-to-speech deep-learning toolkit speech pytorch tts speech-synthesis speech-processing

Updated May 19, 2024
Python

a3ro-dev / voiceTypingApp

Python-based application designed to convert speech to text in real-time.

python script side-project project python3 speech-synthesis speech-recognition speech-to-text speech-processing dad pyttsx3 googlespeech googlespeechapi

Updated May 17, 2024
Python

resemble-ai / resemble-enhance

AI powered speech denoising and enhancement

speech-processing denoise speech-enhancement speech-denoising

Updated May 16, 2024
Python

chimechallenge / chime-utils

Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.

speech-recognition automatic-speech-recognition speech-processing speech-separation speech-enhancement far-field-speech-recognition diarization multi-speaker-asr meeting-transcription

Updated May 16, 2024
Python

vocalpy / vak

A neural network framework for researchers studying acoustic communication

python torch python3 pytorch birdsong speech-processing torchvision bioacoustics animal-communication bioacoustic-analysis vocalizations spectrograms animal-vocalizations

Updated May 11, 2024
Python

itsp

Speech-Interaction-Technology-Aalto-U / itsp

Introduction to Speech Processing

speaker-recognition speech-processing speech-analysis voice-activity-detection speech-enhancement speech-modelling speech-coding speech-quality-evaluation

Updated May 11, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the speech-processing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-processing topic, visit your repo's landing page and select "manage topics."