Is it possible to detect the spoken language? #1541

silvioprog · 2024-03-25T00:41:56Z

Hi.

I have been developing this free transcription website using the model vosk-model-en-us-0.42-gigaspeech, so it should accept only English videos, however, I've noticed some people sending videos in Portuguese, Spanish, Japanese and so on, and I would like to block it.

So, it that possible to detect if the audio (extracted from the video) is really in English language? (Something like whisper.detect_language())

TIA for any help!

The text was updated successfully, but these errors were encountered:

nshmyrev · 2024-03-25T20:54:57Z

There is no problem to use whisper for initial language identification step, you can also use other models like

https://huggingface.co/speechbrain/lang-id-voxlingua107-ecapa

silvioprog · 2024-04-01T00:43:00Z

@nshmyrev After a couple of tests, I decided to go with speechbrain/lang-id-voxlingua107-ecapa. Thanks a lot for this excellent suggestion!

silvioprog changed the title ~~Is it possible to identify the audio language?~~ Is it possible to detect the spoken language? Mar 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to detect the spoken language? #1541

Is it possible to detect the spoken language? #1541

silvioprog commented Mar 25, 2024 •

edited

nshmyrev commented Mar 25, 2024

silvioprog commented Apr 1, 2024

Is it possible to detect the spoken language? #1541

Is it possible to detect the spoken language? #1541

Comments

silvioprog commented Mar 25, 2024 • edited

nshmyrev commented Mar 25, 2024

silvioprog commented Apr 1, 2024

silvioprog commented Mar 25, 2024 •

edited