Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is it possible to detect the spoken language? #1541

Open
silvioprog opened this issue Mar 25, 2024 · 2 comments
Open

Is it possible to detect the spoken language? #1541

silvioprog opened this issue Mar 25, 2024 · 2 comments

Comments

@silvioprog
Copy link

silvioprog commented Mar 25, 2024

Hi.

I have been developing this free transcription website using the model vosk-model-en-us-0.42-gigaspeech, so it should accept only English videos, however, I've noticed some people sending videos in Portuguese, Spanish, Japanese and so on, and I would like to block it.

So, it that possible to detect if the audio (extracted from the video) is really in English language? (Something like whisper.detect_language())

TIA for any help!

@silvioprog silvioprog changed the title Is it possible to identify the audio language? Is it possible to detect the spoken language? Mar 25, 2024
@nshmyrev
Copy link
Collaborator

There is no problem to use whisper for initial language identification step, you can also use other models like

https://huggingface.co/speechbrain/lang-id-voxlingua107-ecapa

@silvioprog
Copy link
Author

@nshmyrev After a couple of tests, I decided to go with speechbrain/lang-id-voxlingua107-ecapa. Thanks a lot for this excellent suggestion!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

2 participants