vad

Here are 86 public repositories matching this topic...

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Jun 4, 2024
Python

smacke / ffsubsync

Sponsor

Star

Automagically synchronize subtitles with video.

Updated Mar 18, 2024
Python

jtkim-kaist / VAD

Star

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

data speech dnn lstm speech-recognition attention vad voice-detection voice-activity-detection bdnn acam speech-activity-detection

Updated Jun 9, 2021
MATLAB

Baidu-AIP / speech-vad-demo

Star

集成Webrtc的VAD，用于切分音频文件

webrtc speech vad webrtc-vad

Updated Aug 26, 2020
C

amsehili / auditok

Star

An audio/acoustic activity detection and audio segmentation tool

vad audio-data audio-activities audio-segmentation voice-detection voice-activity-detection

Updated Mar 30, 2023
Python

filippogiruzzi / voice_activity_detection

Star

Voice Activity Detection based on Deep Learning & TensorFlow

python machine-learning deep-neural-networks deep-learning time-series tensorflow speech artificial-intelligence speech-recognition vad resnet deeplearning time-series-classification voice-activity-detection librispeech speech-detection librispeech-dataset mfcc-features

Updated Mar 24, 2023
Python

CheshireCC / faster-whisper-GUI

Star

faster_whisper GUI with PySide6

openai vad whisper asr transcribe voice-transcription faster-whisper whisperx

Updated Jun 3, 2024
Python

xiongyihui / python-webrtc-audio-processing

Star

Python bindings of WebRTC Audio Processing

python vad ns agc webrtc-audio-processing

Updated Jan 22, 2019
C++

xia-chu / webrtc_apm

Star

webrtc中apm相关代码的提取，包括AEC/NS/AGC/VAD ，另外还包括mp3/aac编码器、SoundTouch

webrtc mp3 aac jni vad ns agc soundtouch aec

Updated Jun 30, 2023
C

gkonovalov / android-vad

Star

Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Updated Feb 12, 2024
C

eesungkim / Voice_Activity_Detector

Star

A statistical model-based Voice Activity Detection

vad voice-detection voice-activity-detection

Updated Nov 30, 2018
Jupyter Notebook

fjchange / object_centric_VAD

Star

An Tensorflow Re-Implement of CVPR 2019 "Object-centric Auto-Encoders and Dummy Anomalies for Abnormal Event Detection in Video"

vad anomaly cvpr2019

Updated May 6, 2022
Python

shanghaimoon888 / mod_vadasr

Star

This is FreeSwitch module that can do VAD and ASR with IFLYTEK websocket api.

freeswitch vad asr freeswitch-esl freeswitch-plugin

Updated Jul 1, 2022
C

voithru / voice-activity-detection

Star

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

vad voice-activity-detection

Updated Oct 26, 2021
Python

0vercl0k / sic

Sponsor

Star

Enumerate user mode shared memory mappings on Windows.

driver windows-10 windows-kernel vad shm shared-memory ntoskrnl prototype-pte

Updated Feb 14, 2021
C

mounalab / LSTM-RNN-VAD

Star

Voice Activity Detection LSTM-RNN learning model

tensorflow lstm rnn vad rnn-tensorflow nlp-machine-learning lstm-neural-network

Updated Apr 17, 2018
Python

EtienneAb3d / WhisperHallu

Star

Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts

text-to-speech sound-processing vad whisper audio-processing asr noise-removal vocals

Updated Feb 6, 2024
Python

shashikg / WhisperS2T

Star

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

deep-learning speech-recognition vad speech-to-text whisper asr tensorrt voice-activity-detection tensorrt-llm

Updated Apr 5, 2024
Jupyter Notebook

NickWilkinson37 / voxseg

Star

A python library for voice activity detection (VAD) for speech/non-speech segmentation.

python python-library speech vad speech-processing voice-activity-detection speech-segmentation

Updated Sep 7, 2022
Python

DmitryRyumin / ICASSP-2023-24-Papers

Star

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Updated Jun 4, 2024
Python

Improve this page

Add a description, image, and links to the vad topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vad topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vad

Here are 86 public repositories matching this topic...

modelscope / FunASR

smacke / ffsubsync

jtkim-kaist / VAD

Baidu-AIP / speech-vad-demo

amsehili / auditok

filippogiruzzi / voice_activity_detection

CheshireCC / faster-whisper-GUI

xiongyihui / python-webrtc-audio-processing

xia-chu / webrtc_apm

gkonovalov / android-vad

eesungkim / Voice_Activity_Detector

fjchange / object_centric_VAD

shanghaimoon888 / mod_vadasr

voithru / voice-activity-detection

0vercl0k / sic

mounalab / LSTM-RNN-VAD

EtienneAb3d / WhisperHallu

shashikg / WhisperS2T

NickWilkinson37 / voxseg

DmitryRyumin / ICASSP-2023-24-Papers

Improve this page

Add this topic to your repo