Streaming Speech-to-Text Web Server. Share transcript to many in realtime.
-
Updated
Dec 26, 2022 - Python
Streaming Speech-to-Text Web Server. Share transcript to many in realtime.
Nhận dạng giọng nói Tiếng Việt sử dụng model Quartznet (Nvidia) + flask demo
Hugging Face Audio coursework
Fine-tuning code for making deepspeech robust to adversarial attacks.
Myanmar (Burmese) Language Grapheme to Phoneme Converter
Created an ASR (Automatic Speech Recognition) system that takes in individual recordings. Each recording represents a sentence composed of 5-10 English language digits, separated by adequate pauses. The system involves segmenting the sentence using a classifier, differentiating between background and foreground sounds.
FM signal capturing system and voice recognition for the assistance of individuals with hearing impairments.
The dataset of Korean conversational speech
North American English Speech Dataset
Shanghai Dialect Speech Dataset
youtube download, vocal remover, vocal extraction, karaoke video production, STT, automatic speech recognition, transcription, automatic subtitle, AI, yt-dlp, demucs, whisper, webui, gradio, windows
A Julia recipe for training an ASR system using the TIDIGITS database
Streaming 가능한 RNN Transducer 모델을 PyTorch Lightning으로 구현해본다.
A compilation of libraries, case studies, resources, and research papers revolving around deep learning/machine learning for audio
Automatic Speech Recognition ASR / Speech To Text STT demonstration using Whisper/base model. The cli python application transcribe an audio to text, works offline.
😺 Research on Automatic Speech Recognition for dysarthric speech
The dataset of German conversational speech
An Android keyboard that performs speech-to-text (STT/ASR) with OpenAI Whisper and input the recognized text; Supports English, Chinese, Japanese, etc. and even mixed languages.
Automatic speech recognition (ASR)
Add a description, image, and links to the automatic-speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the automatic-speech-recognition topic, visit your repo's landing page and select "manage topics."