Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Word Level Transcripts Error #789

Open
Harinderpreet opened this issue Apr 22, 2024 · 8 comments
Open

Word Level Transcripts Error #789

Harinderpreet opened this issue Apr 22, 2024 · 8 comments

Comments

@Harinderpreet
Copy link

I have noticed that whisper x does not provide word-level transcripts for numbers. It simply mess up, it also mess up before and after words transcripts as well

@LandryBulls
Copy link

Also experiencing the same issue.

@SeeknnDestroy
Copy link

SeeknnDestroy commented May 15, 2024

You can use supress_numerals=True in asr_options, this way model will output numbers as strings like seven and you'll be able to use world level timestamps

@Godfreyid
Copy link

@SeeknnDestroy, where can I read up on this documentation? There's a lot of commands and options to said commands, too much to try out and know what they do. How'd you figure out the 'supress_numerals=True' behavior? Do you just tinker?

@SeeknnDestroy
Copy link

@SeeknnDestroy, where can I read up on this documentation? There's a lot of commands and options to said commands, too much to try out and know what they do. How'd you figure out the 'supress_numerals=True' behavior? Do you just tinker?

I have had the same issue before, and i have searched it in the issues to learn if someone else had it too #314 (comment)

@SeeknnDestroy
Copy link

And no, i don't know if there is a documentation, just tinkering and reading the source code, searching issues etc..

@Godfreyid
Copy link

And no, i don't know if there is a documentation, just tinkering and reading the source code, searching issues etc..

Okay, thanks though.

@Godfreyid
Copy link

And no, i don't know if there is a documentation, just tinkering and reading the source code, searching issues etc..

Okay, thanks though.

I just went back and checked the anaconda terminal command: $ whisperX --help
and saw the '--suppress_numerals' option there with a brief description. So this likely is the place for your documentation.

@SeeknnDestroy
Copy link

i am using this as documentation as well if it counts https://github.com/m-bain/whisperX/blob/main/whisperx/transcribe.py#L20-L78 @GodfreyIbrahim

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants