Rhasspy does currently not work very well with Swedish, that is why I want to use Google STT with Rhasspy to make custom voice commands. There is currently no way to use custom voice commands with Google Home, IFTTT or other services in Swedish.
Rhasspy uses 16-bit 16 kHz mono, so have to configure that in the RecognitionConfig and AudioEncoding
The program shoud process the SpeechRecognitionResult and print the text transcription to standard out.
This is the path I would follow if I wanted to do this, but have not tried it.
The remote Rhasspy server does not work, because you need specify with settings of the audio.
Thanks for helping me, I appreciate it. I’ve downloaded speech2text.sh and set Speech To Text to “command”, pointing to the file. But I’m not sure how to configure RecognitionConfig and AudioEncoding in the program. Should I just copy the JSON containing RecognitionConfig and AudioEncoding into the speech2text.sh?
Oh. I’m not really good with coding, so that won’t work…
Well, thanks anyway.
If you or anyone else could make a custom speech2text.sh that works with Google STT and Rhasspy, I would really appreciate it. It would mean a lot
Sorry for revive this post. I would like to use Google for speech to text. I think that the simple solution for me its copy the python code from Google example (I have de credentials, virtual enviorement…), but i don’t know wich file I must to send to transcrip (the .wav where the Rhasspy save the sound).
I want to change STT to google STT = I want to call google via my own python script.
The problem is that I´m not able to understand how should I correctly write path where is my script stored.
My python script is stored in: /home/mirek/.config/rhasspy/profiles/en/stt.py
I tried put exactly this path to setting, but it does not work.
You always must take a look the path’s inside of the continer. Inside the container the profiles path isn’t .config/rhasspy/profiles, it’s /profiles, then the path you must put in Rhasspy configuration should be /profiles/en/stt.py
These project are archived and haven’t mantenance. I’m sure works 3 years a go but now I found some errors, specially in python enviorement. I upload my own version (works for me) GitHub - naudor/rhasspy_Google_STT: Rhasspy with a addon for use Google Speech to Text .
It’s my first git docker image, then maybe I have forgot something or did something wrong. It’s a image for arm64, but if you are working in another system you only need change the Dockerfile to refer another Rhasspy docker image.
I must advice that I only found a “easy” solution to work with Rhasspy. I try some microphones but was impossible to set well in Rhasspy, because they had bad quality and Rhasspy it not know when you finish the voice command (maybe will improve with omnidirectional microphone). If you get better results, please share the configuration parameters ;-).
Finally the only solution was install the Rhasspy Android client in my cellphone. This way works perfect, better than Google Home for example, but you need to acces to the app and click to start to talk and another click to close the voice command.