Is this repo usable for a production use case!! #158

utility-aagrawal · 2024-01-23T16:59:24Z

Hi All,

I am wondering if anyone has used this repo for a production use case. Currently, I am using openai whisper for transcription but want to include speaker diarization now. I have tried pyannote in the past but results from this repo look much better. My concern is that the source code hasn't been written keeping a production use case in mind - not too flexible, too many log messages, etc. I can rewrite this code but what if there were updates in the future. Will appreciate the community's input on this. Thanks!

utility-aagrawal · 2024-01-23T16:59:58Z

@MahmoudAshraf97 , will appreciate your take on this! Thanks for sharing your work!

MahmoudAshraf97 · 2024-01-24T13:01:19Z

Hello and thanks for the input، please open a PR with any changes you see that are useful and we can discuss them together

utility-aagrawal · 2024-01-25T20:46:24Z

@MahmoudAshraf97 , Thanks for your understanding! This is what I want to do:

Leave existing functionalities as-is.
Please see the attached .txt file. Currently, a lot of messages/warnings/logs are displayed in command line, I want to make this optional where users can choose if they want to see these messages.
whisper_diarization_stdout.txt
If users want, they should be able to run the whole pipeline locally. Meaning that they can download all the models in a directory beforehand. Faster-whisper and whisperX load_align_model already have support for this. I can check if other models can also be used in this way. Do you know if this is feasible? What other models are used in this pipeline? I still have to go through the code and don't have this answer yet.
Format the code for readability and usability.

Let me know what you think. It will take some time to make all these changes. Before I spend any time, I wanted to align with you. Thanks!

utility-aagrawal · 2024-02-02T20:49:52Z

@MahmoudAshraf97 , do you have any feedback?

utility-aagrawal · 2024-02-28T16:03:39Z

@MahmoudAshraf97 , thought?

aedocw · 2024-05-08T19:44:27Z

I'm not speaking for @MahmoudAshraf97 here, but if you take a look at his response from Jan 24, it's pretty clear. This is an open source project that he's doing for whatever his reasons are. @utility-aagrawal, you are treating it like a commercial product that you are paying for.

If you want these changes, you are free to implement them and submit the PR's to get them merged into the project. If you are not a developer, you could pay someone to do the work and submit the patches.

transcriptionstream · 2024-05-08T23:42:06Z

I have this running in a production environment - it’s stable, consistent, and does a great job

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is this repo usable for a production use case!! #158

Is this repo usable for a production use case!! #158

utility-aagrawal commented Jan 23, 2024

utility-aagrawal commented Jan 23, 2024

MahmoudAshraf97 commented Jan 24, 2024

utility-aagrawal commented Jan 25, 2024

utility-aagrawal commented Feb 2, 2024

utility-aagrawal commented Feb 28, 2024

aedocw commented May 8, 2024

transcriptionstream commented May 8, 2024

Is this repo usable for a production use case!! #158

Is this repo usable for a production use case!! #158

Comments

utility-aagrawal commented Jan 23, 2024

utility-aagrawal commented Jan 23, 2024

MahmoudAshraf97 commented Jan 24, 2024

utility-aagrawal commented Jan 25, 2024

utility-aagrawal commented Feb 2, 2024

utility-aagrawal commented Feb 28, 2024

aedocw commented May 8, 2024

transcriptionstream commented May 8, 2024