Howto mute the microphone when noise in room (false wake up detection)

alainvdu69 · December 20, 2021, 3:52pm

Hi !

I’m using Rhasspy addon on HASS.IO.

I want to mute the microphone used by Rhasspy when there is noise in the room (tv, music, …) because i want to avoid the numerous false wake up.
The mute/unmute will be done with a Magic Aqara Cube.

Thx.

romkabouter · December 20, 2021, 8:32pm

Send a message to hermes/hotword/toggleOff will disable hotword detection

Check here for the reference:
https://rhasspy.readthedocs.io/en/latest/reference/#hotword-detection

alainvdu69 · December 21, 2021, 5:24pm

Good idea, thanks a lot.

farrad · March 6, 2022, 7:15pm

I tried this one and it does not work as expected.
Despite successfully writing to the hermes/hotword/toggleOff wakeword detection remains enabled.

Rhasspy 2.5.11
Wakeword Engine: Porcupine

romkabouter · March 6, 2022, 7:22pm

what is your payload?

farrad · March 8, 2022, 9:25pm

This is the payload I send:

{
     "siteId": "default",
     "reason": "Voice Command"
}

romkabouter · March 9, 2022, 8:12am

And how is your setup? Is your Rhasspy setup actually called default or did you give your Rhasspy a different siteId?

donburch · March 12, 2022, 8:06am

Hmmm, my partner likes the TV fairly loud while she’s working in the kitchen; and then when watching TV or movie I find I’m constantly raising the volume to hear the dialog then rushing to lower it when the background music or effects comes on. Death is too good for the people who mix the sound for movies

Anyway, this results in a lot of false hotword detections. I have tried adjusting porcupine’s sensitivity, but I believe there’s simply too much noise for porcupine to cope well with. Soooo… trying to think laterally

I previously invested in a Yamaha soundbar with “clearvoice” technology to make dialog clearer … but not much noticable effect.
following this thread I could automatically turn off hotword detection when the TV is turned on, and turn it back on when the TV is turned off. We don’t give many commands while TV is on, but it will be weird having to turn the TV off if we do want to give a command. Possibly I can also detect if a movie is paused ? - but that will probably depend on which AndroidTV app is running.
I haven’t had any luck so far setting or reading the volume level through media_player or androidtv integrations.
wondering about setting up a physical button to pause and/or mute whatever is playing, and turn hotword detection on so we can speak a command.

But first I can ask you all for other options which I have not thought of

Nailik · March 15, 2022, 7:55pm

Depending on your system you use to manage all your devices you can actually get the play status of the TV (playing or paused).

I’m using HomeAssistant with the Android TV addon and NodeRed and a helper input boolean.

On the Bottom you have an interval of 1s to faster update the playing state if the tv is on (otherwise it would be 10s)

On the top it reacts to the playing state and toggles the input boolean and the input boolean then calls the http service to turn on/off the wakeword

donburch · March 18, 2022, 8:38am

Hi Nailik, I have a nVidia ShieldTV (which to HA is just a glorified AndroidTV), a dumb TV, and HA with node-RED … so I’m having a go at implementing your flow.

I was initially a little reluctant about this approach because each androidTV app can operate in different ways; and “pause” doesn’t make much sense when watching live TV or streaming radio. If the ShieldTV’s remote had a “mute” button it would make sense to check that too.

But this does have the advantage of using the shieldTV remote and being relatively easy to implement.

I’m having a little trouble figuring out your bottom sequence … I assume the inject is repeating every 1 second, and updating the state of the media device if the TV is on … but what does the “configurable interval” do ?

Also @romkabouter can you help ? I am trying to use MQTT to toggle rhasspy’s hotword detection, but I have obviously not got the payload correct, since it does not appear to stop listening for hotword. The satellite in my living room with the ShieldTV is “sat-1”. The most obvious nodes are:

When the ShieldTV is paused (or turned off in this case) I am getting in the debug window
Screenshot from 2022-03-18 19-35-35

Nailik · March 18, 2022, 10:21am

but what does the “configurable interval” do ?

I actually didn’t knwo that the inject node can repeat by itself and used it only to start up …

I am trying to use MQTT to toggle rhasspy’s hotword detection

I am using the http service to toggle wakeword detection /api/listen-for-wake and as payload i use the entity state of my input_boolean

You should not put the topic inside the payload but inside the mqtt out node i think

romkabouter · March 18, 2022, 2:29pm

I think you must add the topic in de Topic input box. Not in de payload message. I know it says you can leave it empty, but then you have the specify the topic in the msg object, not the msg payload.

So I suggest removing topic from the payload, and add it to the Topic input box.

donburch · March 19, 2022, 6:05am

@romkabouter, I had the same thought after posting my last message, so changed back to using 2 MQTT nodes which include the topic - but I didn’t have time to test last night.

After a bit of testing, today I have made a couple of simple test sequences:

The “Turn OFF hotword detection” node is:
Screenshot from 2022-03-19 16-35-37
and the MQTT out node is:

I am assuming that leaving the topic in the payload will be unexpected, and so ignored ?

After rebooting the “sat-3B” satellite in my study for testing I have checked that Porcupine is responding, and cleared MQTT Explorer.

Injecting the Turn OFF hotword gives the expected debug result in node-RED, and in MQTT Explorer

Porcupine stopped listening ! That is great !

BUT … when I inject the Turn ON hotword … Porcupine continues to ignore me !

Try again, and even remove the extra topic from the payload … but still no beep when speaking “porcupine”.

Please, can you help identify what I have missed.

donburch · March 19, 2022, 7:16am

Still fiddling with this. I went into the Rhasspy settings for Wakeword, and adjusted the sensitivity vale. On saving this setting, porcupine started listening again.

romkabouter · March 19, 2022, 11:59am

I do not know, you mention toggleOff working so I think you assume correct
Can’t see an error in the toggleOn, maybe you can find what the regular flow publishes to the toggleOn topic. It should be the same, but might be different for some reason?

donburch · March 20, 2022, 3:43am

Or maybe rhasspy on sat-3 has simply stopped listening to MQTT ?

Leading me to the Rhasspy log - no, not the empty “log” in the web user interface - but to journalctl -e -u rhasspy.service … which includes:

ar 19 21:10:10 rhasspy-sat-2 rhasspy[609]: [ERROR:2022-03-19 21:10:10,833] rhasspywake_porcupine_hermes: parse_mqtt_message (topic=hermes/hotword/toggleOn)
Mar 19 21:10:10 rhasspy-sat-2 rhasspy[609]: ValueError: 'TV not playing' is not a valid HotwordToggleReason
Mar 19 21:10:20 rhasspy-sat-2 rhasspy[609]: ValueError: 'TV _is_ playing' is not a valid HotwordToggleReason

So, apparently the “reason” must be a pre-defined value. Another look at https://rhasspy.readthedocs.io/en/latest/reference/#hotword-detection and try with reason: “” … and it works - though on my RasPi 3B it takes 25 seconds for the hotword detection to start working.

Why so long to start working ? Back to the log. This time, as soon as I inject the Hotword ON I get 500 lines of python errors. Maybe something else got corrupted. Try the Microsoft fix: reboot sat-3B satellite and try again…

Turn OFF Hotword works correctly with MQTT explorer showing only the toggleOFF. Check the service log.

Turn ON Hotword detection, and it shows in WQTT Explorer. Log now shows:

Mar 20 12:54:33 rhasspy-sat-2 rhasspy[613]: [DEBUG:2022-03-20 12:54:33,476] rhasspyserver_hermes: Starting web server at http://0.0.0.0:12101
Mar 20 12:54:33 rhasspy-sat-2 rhasspy[613]: Running on 0.0.0.0:12101 over http (CTRL + C to quit)
Mar 20 13:05:39 rhasspy-sat-2 rhasspy[613]: [DEBUG:2022-03-20 13:05:39,487] rhasspywake_porcupine_hermes: <- HotwordToggleOff(site_id='sat-3B', reason=<HotwordToggleReason.UNKNOWN: ''>)
Mar 20 13:05:39 rhasspy-sat-2 rhasspy[613]: [DEBUG:2022-03-20 13:05:39,488] rhasspywake_porcupine_hermes: Disabled
Mar 20 13:08:12 rhasspy-sat-2 rhasspy[613]: [DEBUG:2022-03-20 13:08:12,961] rhasspywake_porcupine_hermes: <- HotwordToggleOn(site_id='sat-3B', reason=<HotwordToggleReason.UNKNOWN: ''>)
Mar 20 13:08:12 rhasspy-sat-2 rhasspy[613]: [DEBUG:2022-03-20 13:08:12,962] rhasspywake_porcupine_hermes: Enabled
Mar 20 13:08:13 rhasspy-sat-2 rhasspy[613]: [DEBUG:2022-03-20 13:08:13,009] rhasspywake_porcupine_hermes: Receiving audio sat-3B

Give Porcupine a command, and it works. Check the log and no errors ! So far, so good … but still takes up to 25 seconds after processing the hotword/toggleON before it starts to respond to the wakeword.

romkabouter · March 20, 2022, 8:52am

That is really long indeed. Can’t say why that happens but nice to see that it works. Maybe pass a know reason speeds it up
I did not know you need to pass a valid reason, that is not well documented

romkabouter · March 20, 2022, 8:56am

These are the valid reasons:

class HotwordToggleReason(str, Enum):
“”“Reason for hotword toggle on/off.”""

UNKNOWN = ""
"""Overrides all other reasons."""
DIALOGUE_SESSION = "dialogueSession"
"""Dialogue session is active."""
PLAY_AUDIO = "playAudio"
"""Audio is currently playing."""
TTS_SAY = "ttsSay"
"""Text to speech system is currently speaking."""

See here:

github.com

rhasspy/rhasspy-hermes/blob/badd69285a775f4f3d1266d83de58214aca1c08f/rhasspyhermes/wake.py

"""Messages for wake word detection."""
import re
import typing
from dataclasses import dataclass
from enum import Enum

from dataclasses_json import LetterCase, dataclass_json

from .base import Message


class HotwordToggleReason(str, Enum):
    """Reason for hotword toggle on/off."""

    UNKNOWN = ""
    """Overrides all other reasons."""
    DIALOGUE_SESSION = "dialogueSession"
    """Dialogue session is active."""
    PLAY_AUDIO = "playAudio"
    """Audio is currently playing."""

This file has been truncated. show original

donburch · March 20, 2022, 9:14am

OK, to recap for any other newbie trying to do this …

I have setup a couple of node-RED sequences to stop Rhasspy from listening for its wakeword in the living room while the TV is playing. This is to remove all those false positives where Porcupine thought it heard it’s name, and then listened to the TV for an intent.

How it works
Whenever the state of the media_player.shield_tv changes, it checks the current state, and saves it in msg.payload.

The switch checks for msg.payload being “playing”; or “paused”, “standby” or any other value. In either case, we setup the msg.payload ready to send by MQTT. Note that “sat-1” is the name of the satellite in my living room, and reason apparently must be an empty string.

Finally use a MQTT Out node with topic of toggleON or toggleOFF as appropriate.
Screenshot from 2022-03-19 16-37-00

The other flow makes the above flow more responsive, by updating the shield_tv’s state every 1 second.
It starts with an inject, which repeats at 1 second interval. If the state of my shield_tv is not “off”, it calls home assistant’s update_entity service to update the media_player.shield_tv

Considerations:

The real issue here is that Rhasspy with a reSpeaker HAT doesn’t do a great job filtering out background noise. The reSpeaker’s multi-mic hardware is good - but needs better device driver or application software to add the desired features. There are microphones with AI features built into their firmware - but at too high a price for me.
Alternatively the google and alexa devices do a great job, but you loose local control.
media_player state of “paused” works for Netflix and many streaming media, but doesn’t make much sense when watching live TV/radio or some of the other AndroidTV apps. If my ShieldTV’s remote had a “mute” button, I would check for that too.
I am still finding that Rhasspy takes about 20 seconds after processing toggleON before it responds to its wakeword.
In the longer run I am thinking of a big red “pause” button to sit on the living room coffee table. Pressing it would get node-RED/Home Assistant to pause or mute the currently running AndroidTV app and enable hotword detection. Releasing (or pressing again) would disable rhasspy hotword detection and un-mute or play the AndroidtTV app.

rolyan_trauts · March 20, 2022, 5:49pm

Only the respeaker USB higher end cards have any form of NS the rest are just multi mic sound cards.
Even then NS (noise supression) works with relatively low level static noise such as fans or heaters and things that are more of a constant hum.
Get anything moving and jumping around such as TV or HiFi then your talking RTXvoice levels of NS or what is often used targeted speaker separation such as VoiceFilterLite which takes a different take to filtering unknown noise and doesn’t bother and extracts known voice.
Still no software like that avail that is currently running on a pi even though the latter could be possible.
Also because Rhasppy is NS agnostic the atrifacts some of the NS algs make can be a problem for recognition than say a system trained for a specific NS.