Skip to content

❓ Can window_size_samples be selected as 160 (10ms)? #443

Answered by snakers4
jifashen asked this question in Q&A
Discussion options

You must be logged in to vote

Can window_size_samples be selected as 160 (10ms)?

No, because we do not have data annotation with such granularity.

what do the parameters threshold, min_silence_samples_at_max_speech, min_speech_samples, max_speech_samples, speech_pad_samples mean

threshold - it is the main activation parameter, probability threshold when the VAD activates
min_speech_samples - the minimal number of audio samples that can constitute speech, this is to suppress spurious activations
max_speech_samples - the same but for maximum speech length

min_silence_samples_at_max_speech, speech_pad_samples - these are a but technical, it is better to look them up in the utils code

and what impact do they have on …

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by snakers4
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
help wanted Extra attention is needed
2 participants
Converted from issue

This discussion was converted from issue #442 on April 10, 2024 08:23.