Skip to content

Query Regarding MP3 Format Support and Accuracy in Silero-VAD #395

Answered by snakers4
Jellun asked this question in Q&A
Discussion options

You must be logged in to vote

read_audio("audio_file_path", sampling_rate=SAMPLING_RATE) use FFMPEG or some other Python modules to load the file? Does it automatically detect the audio file format and convert it to wav before further processing it within the model?

I believe it uses torchaudio, which has a sox_io backend now by default, which uses sox. Not sure about MP3 support in sox, it is always flaky with MP3.

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@Jellun
Comment options

@snakers4
Comment options

Answer selected by Jellun
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants