Does anyone have experience on using custom wakeword with snowboy ?
I have a lot of false positive with words that aren’t near the wakeword. I can avoid that by decreasing sensitivity (ever put audio_gain to 2) but then the detection isn’t easy. Compared to snips custom wakeword, it’s really worse.
I have created the custom wakeword on the snowboy website, with a Surface Pro microphone.
Also, was in the same room where the rpi is. Seems it is better to record it in another room so the background noise isn’t the same and avoid false/positive, will try to record it again.
Anyway, do you have advices and/or feedback on this ? Didn’t find a way to record it on the pi with the respeaker, may be better than surface mic.
I usually just use arecord to record something on the Pi. Use arecord -L to a a list of available devices. A command like this will record it like Rhasspy does:
sensitivity 0.38 still detected lot of false positives. Even asr recognize “le sapin en haut” with nothing talking, just playing some piano.
sensitivity 0.35 is better for false positive but really hard to detect hotword …
Will try to redo custom wakeword and try some different settings, but never had so much problems with snips custom wakewords
@koan Do you know how to use something like Tract to train a wake word detection system?
It seems that to get a good recognition rate for a universal hotword, you need lots of records in the dataset augmented with lots of additional noise variations, etc. Exactly like for an acoustic model but for a very specific word.
For custom wake word it can be challenging using just a few records.
If snowboy and porcupine (which are not really open source) do not cut it, how can we do it?
I have switch microphone from pyaudio to arecord in settings and it seems a lot better !
I can up sensitivity without more false positive
Also, something I got from discussing custom wakeword with snips guys: We should not record custom wakeword samples in same room as where the device will be. Each room have its own background noise, and having this same noise into wakeword can generate more false positives.
So, we should record custom wakeword samples on same device/mic that will detect wakeword, but in a different room, ideally the most silent one where you will never use master or a satellite. So you can reuse custom wakewords on every rhasspy devices with less problems.
No, unfortunately I’m not that deep (anymore) in language technology, and my interest has always been more in symbolic systems than in neural networks. But that won’t help us here…
Well, I don´t think that snips is willing to help, but maybe Sonos?
No, just kidding!
But this user was a maker for Snips and I just know that he also was very upset about the surprised sale to Sonos.
Maybe he is interested to support?!
I don´t know.
What about Mycroft precise - it’s a bit messy to set up, but we should be able to do our own training. Maybe we can even agree on some wakewords as a community and generate enough “noise” to train them?