About batch inference for multi-speakers #70

isjwdu · 2024-04-25T14:38:55Z

Hello, thank you for your great work.

I would like to ask two questions:

Regarding the problem of batch inference of different sentences from the same speaker.
I am now using --file to read a txt file containing multiple lines (taking 4 lines as an example), and an error will be reported during inference:

File "/mnt/E/isjwdu/Matcha-TTS/matcha/models/components/text_encoder.py", line 403, in forward
     x = torch.cat([x, spks.unsqueeze(-1).repeat(1, 1, x.shape[-1])], dim=1)
RuntimeError: Sizes of tensors must match except in dimension 1. Expected size 4 but got size 1 for tensor number 1 in the list.

Is the original code set to read only through a single line? Is there any recommended way if I want to reason about multiple sentences in batches?

For batch inference of the same txt text containing different speakers and different sentences, are there any code modification suggestions and tips?

For example my txt:

p329-016|p329|the norsemen considered the rainbow as a bridge over which the gods passed from earth to their home in the sky.
p316-091|p316|there was no bad behavior.

I want to inference different audio files based on different speakers.

Looking forward to your reply

The text was updated successfully, but these errors were encountered:

shivammehta25 · 2024-05-27T12:14:34Z

Hello,
I have fixed the multispeaker batched synthesis.

For the 2nd part, I have not yet added any code to support for different speaker batched inference. However it should be relatively simple all you need to do is extract the text and speakers from the input file and stack them. I don't plan to merge it into the codebase, but let me know if you need any help with it, I can add it to your fork or something.

Kind Regards,
Shivam

shivammehta25 mentioned this issue May 27, 2024

Dev #74

Merged

shivammehta25 closed this as completed in #74 May 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About batch inference for multi-speakers #70

About batch inference for multi-speakers #70

isjwdu commented Apr 25, 2024

shivammehta25 commented May 27, 2024

About batch inference for multi-speakers #70

About batch inference for multi-speakers #70

Comments

isjwdu commented Apr 25, 2024

shivammehta25 commented May 27, 2024