Where to start?

DanielW · June 17, 2020, 7:16pm

I am also using Kaldi and PicoTTS (it’s OK, but compared to Google or Amazon it is still really bad ).

Wake word is an issue. I am currently using porcupine with one of the universal models. Problem: I don’t really like the wake word choices. All feel weird to say for native german speaker and not easy to pronounce without some German accent. I am currently using “pico voice” as wake word. It recognizes me most of the time and very few false activates.

@moqart
I am thinking about making my only custom wake word with precise. How did you do it (source of the training data?)

HorizonKane · June 17, 2020, 7:17pm

Is it possible to use rhasspy with Amazon Pollylike it was with Snips?

koan · June 17, 2020, 7:38pm

Have you tried MaryTTS? I don’t know the quality of the German voices, but I’m using the voice dfki-prudence-hsmm for English and it’s really enjoyable.

moqart · June 17, 2020, 7:40pm

@DanielW If you want more information on training a custom wakeword model with mycroft precise take a look at this Mycroft Precise Model .
I followed the instructions from mycroft for training your own wakeword model.

moqart · June 17, 2020, 7:43pm

What kind of hardware are you running your system on?
How fast does Mary TTS work on a pi4 ?

JGKK · June 17, 2020, 7:45pm

I wrote down some of my experiences about training a custom wake word here in this post Mycroft Precise Model Problem (computer-en.pb) especially with training a noise resistent model.

koan · June 17, 2020, 7:56pm

This is on a Raspberry Pi 4. I haven’t benchmarked it, but it’s fast enough for me.

DanielW · June 17, 2020, 8:37pm

I just tried the three German voices but there is no huge difference in quality compared to PicoTTS for me. But it uses more RAM and CPU. (it is OK on a Pi 4. But longer texts will slow down Rhasspy)

HorizonKane · June 17, 2020, 9:36pm

Got it up and running. Will first create functionality and try around with different voices later

tobetobe · June 17, 2020, 10:41pm

Have a look at this:

https://rhasspy.github.io/rhasspy-voltron/tutorials.html#shared-mqtt-broker

It explains, how to setup master/satellite with Rhasspy as you are used to with snips.

Of course, no problem, really works well

tobetobe · June 17, 2020, 10:45pm

What mics are you using? Respeaker or Matrix or …?

If Respeaker or Matrix have a look at HLC LED Control in this forum. Just to light up your LEDs, triggered by event

HorizonKane · June 18, 2020, 5:34am

I use the PSeye cam on the big Raspi which is working great. Later I want to try my zeros with a two mic platine attached (may check for the name later) which was working great with snips before. I think those satellites have an LED I might start.

I found the respeakers just to expensive when creating multiple devices. Mist are even placed where they are invisible

HorizonKane · June 18, 2020, 5:37am

I think my other mics are seeed hats if I remember correct.

bwong · August 5, 2020, 9:41pm

Hi @koan how did you get your setup to work with MaryTTS.

The play back speed is sped up as if it is fast-forwarded.

Could you give some pointers?

FredTheFrog · August 5, 2020, 11:47pm

Hi @koan !! Could you please provide instructions for building/installing that MaryTTS voice dfki-prudence-hsmm? I took a very quick look at the MaryTTS github, and am not certain where to get started. Will I need to install a full MaryTTS environment on my Raspbian OS? Thank you in advance for EVERYTHING you do for this Rhasspy community.

No_one · August 6, 2020, 12:06am

@FredTheFrog are you using docker? synesthesiam released a docker image for maryTTS here: https://rhasspy.readthedocs.io/en/latest/text-to-speech/#marytts

FredTheFrog · August 6, 2020, 12:09am

Thank you!! Most appreciated.

koan · August 6, 2020, 6:16am

I didn’t do anything special: I swapped out eSpeak for MaryTTS and it just worked. This doesn’t seem to be a MaryTTS issue, but an issue with the settings for your audio device.

koan · August 6, 2020, 6:18am

I use the Docker image, works fine here.

bwong · August 6, 2020, 10:00am

Hmm if my audio settings were the issue, I would think that espeak should also exhibit the same issue, but it does not. Espeak is the only TTS that works (meaning I can hear the words, and not something that sounds like its fast-forwarded). Can anyone point me to some docs to help me start diagnosing this issue. I’m not really sure what keywords to search for.