Speech Detection 💬
-
Updated
Mar 22, 2022 - CSS
Speech Detection 💬
A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.
This repository contains scripts of activities performed on various deep learning concepts
PocketPiglet for iOS
PocketPiglet for Android
Voice Activity Detection based on Deep Learning & TensorFlow
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
Synchronize your subtitles using machine learning
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
EduSense: Practical Classroom Sensing at Scale
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Automagically synchronize subtitles with video.
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
Add a description, image, and links to the speech-detection topic page so that developers can more easily learn about it.
To associate your repository with the speech-detection topic, visit your repo's landing page and select "manage topics."