Newbie build problem

fluidvoice · January 27, 2023, 7:05pm

Personally being completely new to Python I found the building of this app to be a nightmare of broken dependencies as shown above. Hopefully the modular v3 will help this some.
If you want to start learning the language and writing skills then it’s a big overhead.
Docker is another similar story. If you’re new to it, then getting it to work and debugging problems is a total PITA. I found learning how to use LXD/LXC containers much more straight forward.

chrismiceli · January 29, 2023, 3:34am

My intention is to simply turn on or off a light in home assistant with my voice without going to the cloud. I don’t intend to use satellites or anything.

synesthesiam · January 29, 2023, 5:26am

@fluidvoice I think v3 is exactly what you’ll want. Only install what you need, and all of the services just have the bare minimum of what they need to run.

@chrismiceli What I’ll be working on for Nabu Cases for the Year of Voice may be more of an interest to you. We already have an intent recognizer built into Home Assistant now, and we’re planning to automate the installation of STT/TTS add-ons so you can have it all “just work”. Sort of like Rhasspy Junior but for more than just English.

fluidvoice · January 29, 2023, 12:44pm

@synesthesiam Thanks for the reply Mike, and for all that you have done and are doing for the open source and “can’t be evil” (don’t spy on me) speech assistant space. I feel like you’re one of the few technically savvy people fighting the gorillas that have held back speech rec for soo long via hording the talent, code and speech models. ie; Microsoft, Google, and Amazon all of which have been acquiring the smaller ASR, TTS companies for over 20 years. Hearing about Alexa losing billions of $ I felt like it was kinda justified karma-esque rewards. Long term my hope is open source speech tech wins in a similar fashion to Linux.

I very much look forward to v3 being released, and will try to help the project however I can.
Cheers, Brad.

fluidvoice · January 29, 2023, 12:58pm

@synesthesiam Mike, any idea when even an alpha of v3 will be released?
It would be great if even one single working English STT + TTS config was released so we could test/hack on it, and be able to write skills… if not also help out in some way.

GregD · January 30, 2023, 6:41pm

In my use case I have an Intel Atom system running other services in addition to a Raspberry Pi running HassOS/HA. I see many advantages to NOT run Rhasspy on an RPi/HA server if there are other physical servers available. The one downside is that I had to run Rhasspy on a VM running an older version of Debian. I also discovered that some audio hardware that works in Debian does NOT work in a Debian VM. On the up side Rhasspy has been locking up its VM every week or so, and this affects only Rhasspy and none of the other services I’m running.

In the beginning I tried Rhasspy in a Docker container. That failed immediately due to Docker issues, and the Docker documentation I could find was completely inadequate. I find linux VM technology easier to learn and debug; googling for info was simply more effective.

In my case I use Rhasspy to handle voice command input and TTS output for my music server (Logitech Media Server / Squeezebox). It also does timers, and tells me the time and temperatures.

synesthesiam · January 30, 2023, 10:29pm

Thanks! It’s been exciting to learn about all the new developments in STT, TTS, NLU in the last few years, and disappointing at the same time to see just how much of it is behind closed doors. Maybe I’m weird, but I just don’t get the point of companies publishing about things they can’t even share. Just say it runs on magic and don’t tease us

I’m hoping this week or next! I’ll start a thread once I’m ready and ping people I know are interested. There’s no GUI or automated installation of services yet, so this will be only for people who want to get their hands dirty.

Were you trying to share a microphone with the Docker container? I found this to be one of the most frustrating experiences in my career. So much so that I will recommend to everyone in v3 to use a streaming mic input (like gstreamer) even if the mic is on the same machine.

chrismiceli · January 31, 2023, 4:01am

@synesthesiam I tried rhasspy junior but ran into some issues.

on WSL, the tensorflow doesn’t properly detect architecture, so tflite_runtime fails to install.
Switched to a native machine, and the install script works, but the train script fails. Initially it fails opening a lexicon.db, which seems to be created later. After skipping that error by commenting it out, I get another complaing about kaldi models missing. I am following the README. Any advice there? Should I create issues in that github?
Trying the docker build, it fails too trying to find the model files (rhasspy-junior/Dockerfile at master · rhasspy/rhasspy-junior · GitHub)

GregD · February 1, 2023, 7:18pm

As I recall the command to launch the container failed and reported an error regarding some missing audio device which actually existed. But even before that I was quite frustrated working with the Docker examples. It could have been a personal problem, but while I could find documentation it often seemed to fail to contain the information I was needing.

Getting Rhasspy on a VM to use a microphone was a frustrating experience for me also. Some of it was getting audio hardware to pass through to the VM (adding the correct edits for hardware passthrough to the VM’s config XML resulted in the config file failing validation), some of it was audio hardware driver that worked in Debian but failed in a Debian VM, some of it was my usual frustration with ALSA, and some was learning something about pyaudio and how to test it. I think my installation had some unmet dependencies. My music servers, and the VM for Rhasspy, are built with a fresh install of Debian with no desktop. On the music servers I then install mpd even though I don’t use it. I don’t know exactly what it installs but setting up my audio after that just seems to work whereas without the mpd install I can’t seem to get my audio working the way I want. I suspect I could probably write a tutorial covering all the many steps to end up with Rhasspy running on a VM running on a headless Debian server, maybe including a section on testing audio configuration BEFORE trying to get that working in Rhasspy.

jens-schiffke · February 11, 2023, 9:29am

I use Rhasspy successfully in FHEM as home automation. I am pleased with the announcement that Rhasspy will continue to be open to all systems. Please add me to the group of version 3 testers. Greetings, Jens

synesthesiam · March 3, 2023, 12:31am

@GregD @jens-schiffke @chrismiceli @fluidvoice

jens-schiffke · March 3, 2023, 6:21pm

Thanks, it’s working!

fluidvoice · March 4, 2023, 12:49pm

mosdef will asap. thanks!