automatic-speech-recognition

Created an ASR (Automatic Speech Recognition) system that takes in individual recordings. Each recording represents a sentence composed of 5-10 English language digits, separated by adequate pauses. The system involves segmenting the sentence using a classifier, differentiating between background and foreground sounds.

python classifier automatic-speech-recognition asr openslr mel-spectrogram recognition-algorithms

Updated Sep 12, 2023
Python

SanchezCris / SDR-Automatic-Speech-Recognition

Star

FM signal capturing system and voice recognition for the assistance of individuals with hearing impairments.

python speech-recognition sdr automatic-speech-recognition speech-to-text gnuradio asr software-defined-radio wav2vec2

Updated Apr 17, 2023
Python

Nexdata-AI / 500-Hours-Korean-Conversational-Speech-Data-by-Mobile-Phone

Star

The dataset of Korean conversational speech

audio machine-learning text-to-speech deep-learning dataset wav speech-recognition automatic-speech-recognition speech-to-text speech-processing asr asr-model

Updated Apr 18, 2024

Nexdata-AI / 201-Hours-North-American-English-Speech-Data-by-Mobile-Phone-and-PC

Star

North American English Speech Dataset

audio deep-learning speech tts speech-synthesis dataset speech-recognition automatic-speech-recognition speech-to-text asr asr-benchmark

Updated Apr 19, 2024

Nexdata-AI / 1030-Hours-Shanghai-Dialect-Speech-Data-by-Mobile-Phone

Star

Shanghai Dialect Speech Dataset

audio deep-learning speech tts speech-recognition automatic-speech-recognition speech-to-text asr

Updated Apr 19, 2024

abus-aikorea / studio-free

Star

youtube download, vocal remover, vocal extraction, karaoke video production, STT, automatic speech recognition, transcription, automatic subtitle, AI, yt-dlp, demucs, whisper, webui, gradio, windows

windows ai openai automatic-speech-recognition webui karaoke video-download transcription gradio stt whisper vocal-remover demucs yt-dlp automatic-subtitle

Updated Apr 18, 2024
Python

idiap / TIDIGITSRecipe.jl

Star

A Julia recipe for training an ASR system using the TIDIGITS database

decoding automatic-speech-recognition asr hidden-markov-models wfst

Updated Jun 29, 2021
Julia

YooSungHyun / RNNTransducer

Star

Streaming 가능한 RNN Transducer 모델을 PyTorch Lightning으로 구현해본다.

online pytorch automatic-speech-recognition korean speech-to-text streaming-audio rnn-transducer pytorch-lightning rnn-t

Updated Dec 20, 2022
Python

therealmolf / audaio

Star

A compilation of libraries, case studies, resources, and research papers revolving around deep learning/machine learning for audio

audio music lists list machine-learning deep-learning neural-network resources music-information-retrieval neural-networks automatic-speech-recognition music-generation audioclassification

Updated Sep 13, 2022

alwaz-shahid / whisper-asr-cli

Star

Automatic Speech Recognition ASR / Speech To Text STT demonstration using Whisper/base model. The cli python application transcribe an audio to text, works offline.

speech-recognition openai cli-app automatic-speech-recognition speech-to-text stt speech-processing asr-model whisper-ai

Updated Dec 13, 2023
Python

jmaczan / asr-dysarthria

Star

😺 Research on Automatic Speech Recognition for dysarthric speech

deep-learning automatic-speech-recognition asr self-supervised-learning dysarthric-speech wav2vec2 dysarthria

Updated Apr 9, 2024
Jupyter Notebook

Nexdata-AI / 500-Hours-German-Conversational-Speech-Data-by-Mobile-Phone

Star

The dataset of German conversational speech

audio machine-learning deep-learning speech-synthesis dataset voice-recognition wav speech-recognition automatic-speech-recognition speech-to-text asr

Updated Apr 18, 2024

j3soon / whisper-to-input

Star

An Android keyboard that performs speech-to-text (STT/ASR) with OpenAI Whisper and input the recognized text; Supports English, Chinese, Japanese, etc. and even mixed languages.

android kotlin keyboard ime voice speech voice-recognition speech-recognition openai virtual-keyboard automatic-speech-recognition speech-to-text whisper android-ime chinese-speech-recognition openai-api

Updated Jan 27, 2024
Kotlin

lexust1 / av2txtsum

Star

Automatic speech recognition (ASR)

machine-learning artificial-intelligence automatic-speech-recognition whisper seamlessm4t

Updated Apr 23, 2024
HTML

Improve this page

Add a description, image, and links to the automatic-speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the automatic-speech-recognition topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

automatic-speech-recognition

Here are 287 public repositories matching this topic...

zSeriesGuy / SpeakReader

dangvansam / nvidia-nemo-jasper-quartznet-asr-vietnamese

iammartian0 / Audio101

RobertoAlessandri / DataScienceTask

AnirudhSreeram / Deepspeech-finetune

KyawYeThu-11 / burmese-G2P

BScUniversityCollaborations / automatic-speech-recognition

SanchezCris / SDR-Automatic-Speech-Recognition

Nexdata-AI / 500-Hours-Korean-Conversational-Speech-Data-by-Mobile-Phone

Nexdata-AI / 201-Hours-North-American-English-Speech-Data-by-Mobile-Phone-and-PC

Nexdata-AI / 1030-Hours-Shanghai-Dialect-Speech-Data-by-Mobile-Phone

abus-aikorea / studio-free

idiap / TIDIGITSRecipe.jl

YooSungHyun / RNNTransducer

therealmolf / audaio

alwaz-shahid / whisper-asr-cli

jmaczan / asr-dysarthria

Nexdata-AI / 500-Hours-German-Conversational-Speech-Data-by-Mobile-Phone

j3soon / whisper-to-input

lexust1 / av2txtsum

Improve this page

Add this topic to your repo