kaldi-asr/kaldi is the official location of the Kaldi project.
-
Updated
Apr 30, 2024 - Shell
kaldi-asr/kaldi is the official location of the Kaldi project.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Port of OpenAI's Whisper model in C/C++
Speech recognition module for Python, supporting several engines and APIs, online and offline.
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
A PyTorch-based Speech Toolkit
🧠 Leon is your open-source personal assistant.
💬 Speech recognition for your site
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Faster Whisper transcription with CTranslate2
Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Lingvo
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.
To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."