Rhasspy 2.5.8 Released

Hi everyone :wave:

With the holidays coming up, it seems like a good time to push out a new release. Unlike 2.5.7, there are quite a few new things in 2.5.8 to go over.

Thanks to everyone who contributed, and to the many community members who are helping us build a great voice assistant for everyone :rainbow: . As always, please open GitHub issues so we can squash those bugs :bug:

Larynx TTS

This release finally incorporates the Larynx text to speech system, which is a fork of MozillaTTS. The goal of this TTS system is to provide high quality voices for as many languages as possible, replacing the need for Google Wavenet.

Once it gets warmed up, Larynx runs well on x86_64 systems (NUC, etc.), and OK on a Pi 4. I wouldnā€™t recommend trying to use it on a Pi 3 or 2. It uses PyTorch on the CPU, so there may be room for improvement with a GPU someday in the future.

Out of the box, I have voices for Dutch, German, French, Spanish, and Russian. Many more are currently in progress, including English, Swedish, Portuguese, and Vietnamese :slight_smile:

New Kaldi STT Models

In line with the Master Plan, Iā€™ve trained up Kaldi speech to text models for Italian, Spanish, French, and Russian. You can use these now in Rhasspy by selecting Kaldi in the appropriate profile.

More languages are coming as I locate public speech data. There are also several efforts underway to crowd-source this data from the Rhasspy community and other places. If you know of a good dataset or would like to volunteer, please let me know!

Volume Everywhere

Many users have asked for the ability to adjust Rhasspyā€™s output volume, so Iā€™ve made an effort to add this in a way that (I think) makes the most sense.

In the Settings page, you can now independently set the volumes of:

  • The audio output service (aplay)
  • The text to speech service
  • The dialogue feedback sounds (beeps)

On the main web UI page, there is also a handy ā€œSet Volumeā€ button. If you leave the site ID text box next to it blank, it will change the volume on whatever system youā€™re using. But you can also put specific site IDs in the box and change the volumes of multiple satellites at once (this uses a new MQTT message).

Lastly, thereā€™s a new /api/set-volume HTTP endpoint where you can programmatically set the volume. It takes a ?siteId=site1,site2,.. parameter too if you want to set multiple site ids. Oh, and /api/text-to-speech now has a ?volume=0.5 parameter if you want just one utterance to be quiet.

Complete Changelog

Added

  • Russian Kaldi profile and Larynx TTS voice
  • Spanish Kaldi profile and Larynx TTS voice
  • French Kaldi profile and Larynx TTS voice
  • Italian Kaldi profile
  • German Larynx TTS voice
  • Volume scale (0-1) for feedback sounds and TTS
  • rhasspy/asr/setVolume MQTT message and /api/setVolume HTTP endpoint
  • rhasspy/asr/recordingFinished MQTT message sent immediately after silence detection
  • Satellite site ids to intent handling settings in web UI
  • Group separator for co-located satellites (dialogue.group_separator)
  • num2words support for Swedish (thanks Bostrom!)

Fixed

  • Argument list for sound output command system (jrouly)
  • Expand environment variables in TLS ca_certs
  • spn silence phone in Swedish profile
  • Use callback API in PyAudio to avoid buffer overrun
  • HTTP API JSON should not be forced to ASCII

Changed

  • Default Kaldi language model type is now text FST instead of arpa
20 Likes

Great work! Hope to try it soon :slight_smile:

1 Like

Loaded it up on my server and satellites and no issues so far. Awesome work! Thank you.

I did however notice one of the new features to set the volume of the ā€œbeepsā€ doesnā€™t seem to be working.
Setting the aplay volume on the Satellite seems to affect the beeps and tts, but changing the volume of the ā€œSoundsā€ on the Satellite (even down to .1) doesnā€™t seem to make an audible difference.

Speaking of the beeps, is there a way to simply disable some or all? And if the Wake WAV is disabled, will the delay be shorter before it begins listening for the command?

1 Like

Hmmmm, Iā€™ll take a look. Thanks for the feedback.

If you delete the file name in the web UI, it should stop playing that WAV file. There should be shorter delay too, since thereā€™s no worry of the mic picking up the beeps as speech.

1 Like

Thank you very much for the new version!

I tried Larynx TTS (de-thorsten) on my Server (Synology Intel NAS) with a satellite setup, but I always get an TimeOut Error:

[ERROR:2020-11-20 20:49:28,093] rhasspyserver_hermes: 
Traceback (most recent call last):
  File "/usr/lib/rhasspy/.venv/lib/python3.7/site-packages/quart/app.py", line 1821, in full_dispatch_request
    result = await self.dispatch_request(request_context)
  File "/usr/lib/rhasspy/.venv/lib/python3.7/site-packages/quart/app.py", line 1869, in dispatch_request
    return await handler(**request_.view_args)
  File "/usr/lib/rhasspy/rhasspy-server-hermes/rhasspyserver_hermes/__main__.py", line 1282, in api_train
    result = await core.train()
  File "/usr/lib/rhasspy/rhasspy-server-hermes/rhasspyserver_hermes/__init__.py", line 461, in train
    timeout_seconds=self.training_timeout_seconds,
  File "/usr/lib/rhasspy/rhasspy-server-hermes/rhasspyserver_hermes/__init__.py", line 971, in publish_wait
    result_awaitable, timeout=timeout_seconds
  File "/usr/lib/python3.7/asyncio/tasks.py", line 449, in wait_for
    raise futures.TimeoutError()
concurrent.futures._base.TimeoutError

How can I get more information what is not working? Is there somewhere more debug info?

Thank you!

1 Like

Youā€™re welcome! Do you need any messages from rhasspytts_larynx_hermes in the log? It can take some time for MozillaTTS to load the model; you should see a message that it successfully created a synthesizer.

Unfortunately I donā€™t see such a log.
Only:

[DEBUG:2020-11-20 23:00:00,771] rhasspyprofile.download: Skipping tts/larynx/de/thorsten/vocoder/config.json (/profiles/de/tts/larynx/de/thorsten/vocoder/config.json)
[DEBUG:2020-11-20 23:00:00,770] rhasspyprofile.download: Skipping tts/larynx/de/thorsten/vocoder/checkpoint_500000.pth.tar (/profiles/de/tts/larynx/de/thorsten/vocoder/checkpoint_500000.pth.tar)
[DEBUG:2020-11-20 23:00:00,768] rhasspyprofile.download: Skipping tts/larynx/de/thorsten/scale_stats.npy (/profiles/de/tts/larynx/de/thorsten/scale_stats.npy)
[DEBUG:2020-11-20 23:00:00,767] rhasspyprofile.download: Skipping tts/larynx/de/thorsten/config.json (/profiles/de/tts/larynx/de/thorsten/config.json)
[DEBUG:2020-11-20 23:00:00,766] rhasspyprofile.download: Skipping tts/larynx/de/thorsten/checkpoint_380000.pth.tar (/profiles/de/tts/larynx/de/thorsten/checkpoint_380000.pth.tar)
[DEBUG:2020-11-20 23:00:00,764] rhasspyprofile.download: text_to_speech.system larynx larynx = True

OK, do you see files in your profile under the tts/larynx directory?

Yes:

/de/tts/larynx$ ls -Ra
.:
. ā€¦ cache de

./cache:
. ā€¦

./de:
. ā€¦ thorsten

./de/thorsten:
. ā€¦ checkpoint_380000.pth.tar config.json scale_stats.npy vocoder

./de/thorsten/vocoder:
. ā€¦ checkpoint_500000.pth.tar config.json

@synesthesiam I have my Rhasspy server running as a Home Assistant Add-On.

There is currently no option to update this from 2.5.7.2 in HA.

Does it typically take a while to filter through to HA?

It is currentlt already available, but is it renamed to Rhassy Assistant, removing the 2.4 version.
You can savely install that and remove the Rhasspy 2.5, but first make a copy of the configuration

Do not worry, your profiles folder on the share will not be deleted

After uninstalling Rhasspy Assistant 2.5 and installing the new Rhasspy Assistant (pointing to 2.5.8), when you reload the addons the 2.5 will be gone :slight_smile:

2 Likes

@synesthesiam Congratulations! Thank you for your work!

Seems something broken in downloading Kaldi base_dictionary.txt
Canā€™t download https://raw.githubusercontent.com/rhasspy/ru_kaldi-rhasspy/raw/master/base_dictionary.txt.gz - 404: Not Found

Youā€™re welcome :slight_smile:

Ah, I see what happened here. Iā€™ll get a fix pushed out for this soon.

@Thargor, silly question: have you tried completely restarting Rhasspy? For some reason, the voice didnā€™t work for me until I did this.

But I do have this in my console log:

[DEBUG] {'de-thorsten': {'model_path': PosixPath('/home/hansenm/.config/rhasspy/profiles/de/tts/larynx/de/thorsten/checkpoint_380000.pth.tar'), 'config_path': PosixPath('/home/hansenm/.config/rhasspy/profiles/de/tts/larynx/de/thorsten/config.json'), 'vocoder_path': PosixPath('/home/hansenm/.config/rhasspy/profiles/de/tts/larynx/de/thorsten/vocoder/checkpoint_500000.pth.tar'), 'vocoder_config_path': PosixPath('/home/hansenm/.config/rhasspy/profiles/de/tts/larynx/de/thorsten/vocoder/config.json')}}
[DEBUG] Creating Larynx synthesizer (de-thorsten)...
[INFO] Created synthesizer for de-thorsten

Rhasspy runs on a Synology NAS inside Docker. You are right, the Docker log shows:

If I try to say ā€œDas ist ein Testā€ from the Satellite-Webfrontend, i get the following log:

Seems like there is a problem with an ā€œillegal instructionā€ and the process crashed ā€¦
Any Idea what could be the reason?

OK, Iā€™m guessing that the CPU in your Synology does not support AVX instructions. Iā€™m using the official PyTorch CPU wheel for x86_64, which is probably compiled for AVX.

I may try and compile my own non-AVX wheel. It sucks that the moment you step into PyTorch/Tensorflow land, it suddenly really matters which year your CPU was made or what tier it is.

1 Like

Yes you are right, it is an Celeron J3455 without AVX. At least, this mystery is solved.

2 Likes

Many thanks for that !!

Iā€™ve integrated it into Jeedom plugin, works perfect !
We now have to set device volume to 100% when raspberry start :smile:
amixer -c 0 set Playback 100%

2 Likes

Hi

First post here so apologies if Iā€™ve messed something up.
Iā€™ve just updated the deb package from 2.5.7 to 2.5.8. When I run ā€œrhasspy -p enā€ I now get this errorā€¦

Starting upā€¦
Using virtual environment at /usr/lib/rhasspy/.venv
python3: error while loading shared libraries: libpython3.7m.so.1.0: cannot open shared object file: No such file or directory

On the latest Ubuntu (20.10) python is 3.8. Iā€™ve tried sym linking the 3.8 lib to that file but then I just get more errorsā€¦
symbol lookup error: python3: undefined symbol: _Py_UnixMain

Is this something anyone else has come across or has a workaround for ?

1 Like

Hi @TotalSpaceshipguy, thanks for posting :slight_smile:

Can you try apt-get installing libpython3.7 and see if that fixes it? I tried a different method for packaging the .deb files this time around, and I may have missed a requirement.