Hi everyone! This is a “preliminary” release of 2.5.10, meaning I’ve created a rhasspy/rhasspy:2.5.10
Docker tag and uploaded new Debian packages, but I haven’t made these the “latest” release yet (the docs are not updated yet either). Due to time constraints, I haven’t been able to run all my usual tests; but I wanted to get something out to everyone
There are some big additions in 2.5.10, such as:
- ASR support for Swedish (
sv
) - New Larynx that’s faster and has a ton of new voices
- Hint: switch to “Low Quality” on a Pi 4 or below for a big speed-up
- This version should also work with older x86_64 CPUs (no AVX)
- Kaldi ASR now has confidence value for words and sentences
- Dialogue manager now has a minimum ASR confidence threshold (
speech_to_text.<system>.min_confidence
where<system>
iskaldi
,deepspeech
, etc.)
I’d appreciate any testing and feedback that the community can offer! Thanks
Added
- New version of Larynx with improved performance and 35 voices (20 English, 1 German, 3 French, 2 Spanish, 3 Dutch, 2 Italian, 1 Swedish, 3 Russian)
- Kaldi ASR model for Swedish (sv)
- Confidence and word timings for Kaldi ASR
- Minimum ASR confidence threshold for dialogue manager
- Detect AVX support and warn for Larynx, DeepSpeech, and Precise in Web UI
- Handle spaces in converter arguments with word!(converter, …)
- rhasspy-tts-cli-hermes TTS commands may be Jinja2 templates (–use-jinja2)
- Support for MaryTTS effects (jasonhildebrand)
- customData added to hermes/nlu/query message
- customData is copied by NLU services from query to intent/intentNotRecognized
- lang property added for wake, speech_to_text, and intent profile sections
- Wake, ASR, NLU services all set lang properties if null
Fixed
- Remote HTTP service sets site_id of satellite for ASR/NLU endpoints
- DeepSpeech token output (was letters, now words)
- Multiple values in custom converters are sent as a list on stdin
- Don’t show restart/shutdown button if “sudo” isn’t available (Docker, Hass.io)
- Added missing espeak phonemes for some profiles
- MaryTTS voice test in Web UI
- Remove dialogue session from site cache on end
- Don’t throw error about system not configured if message is intent for satellite (schnopsi)
Changed
- /api/listen-for-command uses a proper wake workflow now (requires dialogue manager)
- Show absolute paths for custom models (precise, snowboy, porcupine) in Web UI