End-to-End Speech Processing Toolkit
-
Updated
May 29, 2024 - Python
End-to-End Speech Processing Toolkit
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
A PyTorch-based Speech Toolkit
A python package to build AI-powered real-time audio applications
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Speaker Verification using Pytorch
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Speaker Diarization, Recognition and Language Identification. Scripts to generate GT using our WebApp and Praat software
Speaker diarization service
turnkey self-hosted offline transcription and diarization service with llm summary
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
On-device speaker diarization powered by deep learning
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Full-stack Transcription-UI: Features OpenAI Whisper and NVIDIA NeMo, with Docker for easy deployment.
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
Subtitle generation w/ Speaker Diarization using Whisper and pyannote.audio
Add a description, image, and links to the speaker-diarization topic page so that developers can more easily learn about it.
To associate your repository with the speaker-diarization topic, visit your repo's landing page and select "manage topics."