Port of OpenAI's Whisper model in C/C++
-
Updated
May 15, 2024 - C
Port of OpenAI's Whisper model in C/C++
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
🧠 Leon is your open-source personal assistant.
kaldi-asr/kaldi is the official location of the Kaldi project.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Faster Whisper transcription with CTranslate2
Speech recognition module for Python, supporting several engines and APIs, online and offline.
A PyTorch-based Speech Toolkit
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
💬 Speech recognition for your site
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Lingvo
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0
Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.
To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."