Parallel request to the server #188

alpcansoydas · 2024-03-21T07:37:23Z

For example, the server side is deployed. Can the server-side handle multiple parallel transcription requests? How many requests can it be handled? Will there be any performance issues? It may be a basic question, but I wanna know about it. Thanks:)

cjpais · 2024-03-21T16:25:59Z

Yes it can. On a RTX4080 I am able to get 4 parallel streams without issue. 4 streams is the default value for max_clients in the TranscriptionServer class. You can specify more or less for your particular application.

With 4 parallel streams I see minimal performance impact to my eye, but I have not benchmarked it.

One note is that: each stream loads into VRAM, so eventually you will be limited by VRAM.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel request to the server #188

Parallel request to the server #188

alpcansoydas commented Mar 21, 2024 •

edited

cjpais commented Mar 21, 2024 •

edited

Parallel request to the server #188

Parallel request to the server #188

Comments

alpcansoydas commented Mar 21, 2024 • edited

cjpais commented Mar 21, 2024 • edited

alpcansoydas commented Mar 21, 2024 •

edited

cjpais commented Mar 21, 2024 •

edited