long-form/streaming support? #53

Blazzycrafter · 2024-02-14T18:34:58Z

i wanna use it in role plays and the audio is mostly 500+ chars big so the generation is long.....
is there and stream mode planned?
like in xtts?

vatsalaggarwal · 2024-02-21T09:13:41Z

We're planning to release long-form and streaming soon after we've had some bandwidth to push code with faster inference...

by the way, can you point me to how you're generating 500+ chars / streaming with xtts? i've tried https://huggingface.co/spaces/coqui/xtts but this has a 200 chars limit...

platform-kit · 2024-03-12T20:37:57Z

Hey @vatsalaggarwal, is that release still in the pipeline?

sidroopdaska · 2024-03-14T13:27:59Z

@platform-kit, yes release is still planned. We just released fine-tuning capabilities #93. We are now going to start working on long-form & streaming.

Would love insights on the below

by the way, can you point me to how you're generating 500+ chars / streaming with xtts? i've tried https://huggingface.co/spaces/coqui/xtts but this has a 200 chars limit...

platform-kit · 2024-03-14T18:11:12Z

@sidroopdaska The way I did this in my implementation of XTTS (https://github.com/Render-AI/cog-xtts-v2/blob/main/predict.py) was to split the text into chunks (i.e. sentences, but it could be done in other ways), then render each sentence as an audio output and then concatenate the audio.

You do lose some context this way but it makes the output very stable (avoiding weird outputs where the voice trails off as the duration increases, for example).

MethanJess · 2024-03-25T06:44:07Z

Would love insights on the below

by the way, can you point me to how you're generating 500+ chars / streaming with xtts? i've tried https://huggingface.co/spaces/coqui/xtts but this has a 200 chars limit...

@sidroopdaska
daswer123 has made a WebUI that has infinite amount of text input, the API streaming is still coming soon though. https://github.com/daswer123/xtts-webui

vatsalaggarwal changed the title ~~steam support?~~ streaming support? Feb 29, 2024

vatsalaggarwal changed the title ~~streaming support?~~ long-form/streaming support? Feb 29, 2024

vatsalaggarwal added the feature request New feature or request label Mar 12, 2024

MethanJess mentioned this issue Apr 1, 2024

Support for long-form synthesis. #120

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

long-form/streaming support? #53

long-form/streaming support? #53

Blazzycrafter commented Feb 14, 2024

vatsalaggarwal commented Feb 21, 2024

platform-kit commented Mar 12, 2024

sidroopdaska commented Mar 14, 2024

platform-kit commented Mar 14, 2024

MethanJess commented Mar 25, 2024

long-form/streaming support? #53

long-form/streaming support? #53

Comments

Blazzycrafter commented Feb 14, 2024

vatsalaggarwal commented Feb 21, 2024

platform-kit commented Mar 12, 2024

sidroopdaska commented Mar 14, 2024

platform-kit commented Mar 14, 2024

MethanJess commented Mar 25, 2024