This is not really related to Rhasspy - but I am interested in using this text-to-speech engine, as the quality is very good. My problem is related to the rendering speed, for example, this phrase:
curl -G --output - --data-urlencode 'text=Welcome to the world of speech synthesis!' 'http://192.168.0.61:5002/api/tts' | aplay
Takes about 10 seconds to render before playback on the synesthesiam mozilla-tts container on an AMD FX-6300 6core processor with 16GB ram.
I know its quite old hardware, but has anyone else used this tts service and obtained near instant (1-2 second delay or less) results on newer generation processors, or if there are any tweaks I can do to speed up the rendering on my hardware?