Vosk api: allow selecting different models and automatic model download #657

rebootl · 2023-01-31T14:58:43Z

Hello,
I added/changed two parameters in the recognize_vosk function, which i believe to be useful.

Firstly, I added a model parameter to allow to select a model based on a model directory. This was previously hard coded to one directory named 'model', making it impossible to easily switch models respectively languages.

Secondly, I noticed that Vosk-api is actually able to download models by itself, based on a given language code. So I implemented this as another parameter 'language'. Previously there was a default language parameter provided in the function, but it was never used.

I implemented it so that the model parameter has precedence over the language parameter, but it defaults to an empty string (False). So that by default the language model is downloaded automatically, because i believe this to be more convenient for the user. However I'm aware that this breaks the previous behavior and may break a user implementation if they want to use a very specific model that they already downloaded (for example one of the larger models). If you think this is a problem i could try to change that.

Also see the updated README in the commit.

Please let me know what you think
Thanks

ps.: if this gets merged i would also update the documentation and maybe write some tests for vosk, which i think are still missing.

pss: also it may be good to actually change to return value of the vosk function to make it more in line with the other functions, since i think this is currently returning a json string, instead of a simple string like the others. (Edit: mentioned here aswell: #592 )

Edit: also now i found this similar pull request #607 adding a model path

recognize_vosk function: - Added function parameter 'model', to select a model based on the model directory. - Also make use of Vosk capability to download the model by itself, by providing language parameters. (the language parameter was previously present but not used) Also see the updated README.

Bae-ChangHyun · 2023-11-03T10:11:04Z

Thank you! I'll use it well

finalRecognition = rec.FinalResult()
finalRecognition = json.loads(finalRecognition)

return finalRecognition.get('text')

Luca-Pozzi · 2024-04-30T13:40:12Z

Hi everybody!
Just out of curiosity, there is any particular reason for not to merge this PR?
It looks like a nice-to-feature for which @rebootl proposed a valid implementation.

I would be happy to work on it if needed!

rebootl mentioned this pull request Mar 7, 2023

ALSA errors rebootl/linux-speech-typer#2

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vosk api: allow selecting different models and automatic model download #657

Vosk api: allow selecting different models and automatic model download #657

rebootl commented Jan 31, 2023 •

edited

Bae-ChangHyun commented Nov 3, 2023 •

edited

Luca-Pozzi commented Apr 30, 2024

Vosk api: allow selecting different models and automatic model download #657

Are you sure you want to change the base?

Vosk api: allow selecting different models and automatic model download #657

Conversation

rebootl commented Jan 31, 2023 • edited

Bae-ChangHyun commented Nov 3, 2023 • edited

Luca-Pozzi commented Apr 30, 2024

rebootl commented Jan 31, 2023 •

edited

Bae-ChangHyun commented Nov 3, 2023 •

edited