Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
-
Updated
May 27, 2024 - C++
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
Automagically synchronize subtitles with video.
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
EduSense: Practical Classroom Sensing at Scale
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Synchronize your subtitles using machine learning
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
Voice Activity Detection based on Deep Learning & TensorFlow
PocketPiglet for Android
PocketPiglet for iOS
This repository contains scripts of activities performed on various deep learning concepts
A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.
Speech Detection 💬
Add a description, image, and links to the speech-detection topic page so that developers can more easily learn about it.
To associate your repository with the speech-detection topic, visit your repo's landing page and select "manage topics."