跨平台、多模态、高度自定义的骰系开发框架 | “如何更好的为冷门规则书做适配”?| “如何更好的实现人机交互?”
-
Updated
May 28, 2024 - Python
跨平台、多模态、高度自定义的骰系开发框架 | “如何更好的为冷门规则书做适配”?| “如何更好的实现人机交互?”
Analyze an audio file and count words, sentences and timestamps, filler words
Voice assistant using Multimodal LLMs - LLaVA-NeXT (Mistral 7B) finetuned & PhoWhisper
Add a description, image, and links to the audio-speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the audio-speech-recognition topic, visit your repo's landing page and select "manage topics."