Self Checks
Cloud or Self Hosted
Self Hosted (Source)
Environment Details
WIndows 11, Python 3.10 ,torch 2.4.1.+cu124, torchaudio 2.4.1.+cu124, torchvision 0.19.1++cu124, gradio 5.15.0, test at gradio web ui
Steps to Reproduce
- I loaded a German 12s audio file and added the correct transcription.
- I entered the following 2 texts for the synthesis: "Das ist ein schöner Text." and "Hier ist ein schöner Text.".
✔️ Expected Behavior
Complete synthesis of all words.
❌ Actual Behavior
The first one or two words were not or only partially reproduced. This seems to depend on the audio file and does not happen always. But, this file isn't corrupted. Per example with other tts libraries, there are no problems.
Self Checks
Cloud or Self Hosted
Self Hosted (Source)
Environment Details
WIndows 11, Python 3.10 ,torch 2.4.1.+cu124, torchaudio 2.4.1.+cu124, torchvision 0.19.1++cu124, gradio 5.15.0, test at gradio web ui
Steps to Reproduce
✔️ Expected Behavior
Complete synthesis of all words.
❌ Actual Behavior
The first one or two words were not or only partially reproduced. This seems to depend on the audio file and does not happen always. But, this file isn't corrupted. Per example with other tts libraries, there are no problems.