Speech-to-text server framework with next-gen Kaldi
-
Updated
May 27, 2024 - C++
Speech-to-text server framework with next-gen Kaldi
⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
ioBroker Adapter for myUplink.com for Nibe Heat Pumps
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
PyTorch CTC Decoder bindings
基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。
A major project in ML where we created an ASR for Nepali language using BILSTM and RESNET, and a work in progress for the creation of Nepali Voice Assistant. Some tasks are included like playing songs, searching for information in google
Connectionist Temporal Classification (CTC) decoder with dictionary and language model.
TensorFlow implementations of losses for sequence to sequence machine learning models
Neural network transcribes speech on phonemes/character level
Persian Speech Segmentation by CTC-segmentation Method
Neural network trained to regonize short speech commands
[EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".
CRNN + CTC
Simple Z80 CTC and DART board compatible with the Amstrad CPC and MX4 connector
This repository contains code to build an optical character recognition (OCR) model for recognizing text in captcha images using a Convolutional Recurrent Neural Network (CRNN) architecture with Connectionist Temporal Classification (CTC) loss.
一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1
Add a description, image, and links to the ctc topic page so that developers can more easily learn about it.
To associate your repository with the ctc topic, visit your repo's landing page and select "manage topics."