I want to build one or a later stage multiple voice assistants primary to control lights and other devices in Home Assistant. I also would lilke the option to use them to play back music or other media. I want to use it in German.
I am currently running Home Assistant on a Pi 3 (venv install) but I am planing to move that to a x64 server with docker when I get the new MQTT-ZWave running on the Pi.
CPU/Memory wise it seems like a good idea to go the Rhasspy on server + satellite route. But on the other hand if I need at least a Pi 3 for good wake word detection and low latency anway would not do the speech recognition on the device too?
Another question is, what microphone to get. I found that really confusing.
At first I thought about getting a Matrix Voice ESP32 and use it a standalone satellite. The advantage would be to have a small device with low power usage that I could easly put insinde a case with a speaker.
BUT: It doesn’t seem to do any kind of audio processing like echo cancellation or beam forming because Matrix never finished those advertised features. It also can not play back audio in good quality by itself over MQTT.
AEC in general seems really important if you want to play back audio and still be able use wake word detection.
Another option are the ReSpeaker microphones. Do they all support hardware AEC and other audio processing? From what I understand that to use it you are limited to 16000 Hz playback. That doesn’t seem great for music. It seems like I need to wait for a feature device to support more than that or do AEC in software?
Which of the ReSpeaker products would be my best option?
I also looked at USB microphones like the Jabra Speak 510. You canget a speaker+microphone in a professional case and can hide a Pi somewhere else. But: How good do they work in practice, when you are a few meters away?