After a few quiet months, Rhasspy 2.5.6 has finally been released! This release includes contributions from many community members, credited below
Two of the biggest features in this release:
- Multi-site support for dialogue manager
- For MQTT base/satellite setups - allows each satellite to maintain an independent dialogue session. Previously, the newest session would cancel the previous one.
- A possible bug has already been found
- āText FSTā language model type for Kaldi
- For very large Rhasspy grammars, the default (ARPA) language models may behave poorly by adding extra words
- In āText FSTā mode, the ASR system can only produce sentences from your
sentences.ini
file (no extra words or different word order). This is faster to train and more accurate if you only care about using your exact voice commands.
Thanks to everyone for contributing, testing, answering questions, and helping to make Rhasspy better!
Lastly, as I mentioned in The Master Plan, weāll be looking for volunteers soon to donate their voices for new Rhasspy text to speech voices. We have volunteers for English, Dutch, and German currently. If you speak a different language, have a good microphone (like a Blue Yeti Nano), and are willing to license your recordings as public domain or Creative Commons, please let me know
Changelog
Added
- Multi-site support for dialogue manager
- Add āText FSTā language model type for Kaldi for strict grammar-based recognition
- UDP audio settings in web UI for Pocketsphinx wake word system
- Rudimentary SSML support in Google Wavenet TTS (digitalfiz)
Changed
- JSON output from all services is no longer forced to be ASCII
- fuzzywuzzy performance improvement by using sqlite database (maxbachmann)
- Lots of documentation improvements (koen)
- Strip commans from replaced numbers (āone thousand, one hundredā)
- Improve rhasspy-nlu performance (maxbachmann)
- Simplify Google Wavenet voice selection UI (Romkabouter)
- Fix local command when not using absolute path (DeadEnd)