Speech Detection 💬
-
Updated
Mar 22, 2022 - CSS
Speech Detection 💬
This repository contains scripts of activities performed on various deep learning concepts
PocketPiglet for Android
PocketPiglet for iOS
A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
EduSense: Practical Classroom Sensing at Scale
Synchronize your subtitles using machine learning
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Voice Activity Detection based on Deep Learning & TensorFlow
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Automagically synchronize subtitles with video.
Add a description, image, and links to the speech-detection topic page so that developers can more easily learn about it.
To associate your repository with the speech-detection topic, visit your repo's landing page and select "manage topics."