You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, thank you for the proposal! Looks nice but what is the usecase please? I can't imagine the user needs to start from certain offset instead of just processing the whole file.
Have a recording of an interview and a list of the start times of every question and answer. You may want to assign the transcripted parts to their respective time points (question and answer).
You have a music radio programm with the radio speaker commenting every two or three songs. You may want to transcribe only the radio speaker but not the music songs.
And last but not least: you have an audio file with different languages spoken by different speakers. You may want to transcript different parts of the audio in different languages using the corresponding language and model.
Currently the transcriber processes the whole input file. From the beginning to the end.
It would be very useful to be able to pass a start time offset and/or a duration to the transcriber.
Here is a proposal how to do it:
Add (ffmpeg's) arguments
time_off
andduration
inpython/vosk/transcriber/cli.py
after line 46.Pass the arguments
time_off
andduration
to ffmpeg in functionresample_ffmpeg
inpython/vosk/transcriber/transcriber.py
(line 115):The function
resample_ffmpeg_async
could be adapted similarly.The text was updated successfully, but these errors were encountered: