Freeswitch ASR module to working with wisper_cpp
-
Updated
May 15, 2024 - C
Freeswitch ASR module to working with wisper_cpp
Multimodal Emotion eXpression Capture Amsterdam. Pipeline for capturing emotion expressions from multiple modalities (video, audio, text) in the wild.
Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
This project implement end to end realtime speech recognition with PhoWhisper in Backend and frontend in React Native
Anaouder mouezh e Brezhoneg gant Vosk
Port of OpenAI's Whisper model in C/C++
Easy-to-use speech toolset. Written in TypeScript. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift
A web UI Project In order to learn the large language model. This project includes features such as chat, quantization, fine-tuning, prompt engineering templates, and multimodality.
VocalTexter is a simple and user-friendly web application that enables users to convert spoken words into written text and also convert text into voice. The application features a easy-to-use recording controls, a convenient copy button for quick text sharing and a easy to convert text to voice.
Short code for dictation using OpenAI Whisper for transcription.
Open Voice OS Status Page
Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for efficient audio transcription.
General-purpose AI assistant with voice IO support and custom system prompts powered by OpenAI.
A set of guides to get you doing great things with IDOL Media Server!
Android web novel reader
Achieve your goals and keep your data private with Lotti. This life tracking app is designed to help you stay motivated and on track, all while keeping your personal information safe and secure. Now with on-device speech recognition.
Go SDK for Deepgram's automated speech recognition APIs.
Personal Desktop Assistant, Jarvis, built using python.
Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.
To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."