DTLN: Realtime Machine Learning based Noise Suppression / AEC on Raspberry Pi

rolyan_trauts · June 3, 2021, 7:23pm

@fastjack you seem to have both SR versions whilst you just need one I presume and choose between 48k or 16k

Depending on your hardware many will only use 44.1 or 48k via hw: but presume you can use plughw: so it will do auto conversion. 16k sometimes isn’t supported via hw: direct

I often use https://www.volkerschatz.com/noise/alsacap.tgz rather than hunt around cat /proc/asound/cards

fastjack · June 3, 2021, 8:15pm

Thanks @rolyan_trauts

sanebow · June 4, 2021, 8:05am

Nice spotting @rolyan_trauts . alsacap is a nice tool also the author has a nice article to demystify alsa which helped me a lot.

It’s quite strange that when use the multi plugin we can’t use plug to warp hardware devices for auto rate conversion, it will result in xruns. Have to use a rate to explicitly set target rate. Maybe some bug in alsa?

Remember you have a Arch with latest alsa running. At your convenience, please help to test if this bug was fixed on more recent alsa versions.

rolyan_trauts · June 4, 2021, 10:25am

Will try but been a bit busy with some House DiY and boy the pollen count has been getting my hay fever and even more useless than normal.

I have a gut feeling that with the Multi plugin it may be no for plughw: like you say and explicitly using Rate.

Alsa can be so much fun

fastjack · June 4, 2021, 1:40pm

Hey all!

Still no luck with python3 aec.py -m 128 -i aec_internal -o aec_internal

sounddevice.PortAudioError: Error opening InputStream: Unanticipated host error [PaErrorCode -9999]: 'Invalid argument' [ALSA error -22]

I can record and play correctly from/to aec_internal using ALSA arecord and aplay. Same for aec.

arecord -D aec_internal -f S16_LE -r 16000 -c 2 rec.wav -V stereo
Recording WAVE 'rec.wav' : Signed 16 bit Little Endian, Rate 16000 Hz, Stereo
             +#################### 62%|00%+

aplay -D aec_internal -r 16000 test.wav -V mono
Playing WAVE 'test.wav' : Signed 16 bit Little Endian, Rate 16000 Hz, Mono
#############                                    + | 97%

Any idea?

Note: I’m using Python 3.7.3 on Raspbian GNU/Linux 10 (buster)

rolyan_trauts · June 4, 2021, 2:27pm

Post your /etc/asound.conf and aplay / arecord -l and will try and have a look as seems to be just alsa.

fastjack · June 4, 2021, 2:31pm

Here you go:

asound.conf

defaults.pcm.aec.playback_hw.card "1"
defaults.pcm.aec.playback_hw.rate 48000

defaults.pcm.aec.capture_hw.card "1"
defaults.pcm.aec.capture_hw.rate 48000

aplay -l

**** List of PLAYBACK Hardware Devices ****
card 0: Headphones [bcm2835 Headphones], device 0: bcm2835 Headphones [bcm2835 Headphones]
  Subdevices: 8/8
  Subdevice #0: subdevice #0
  Subdevice #1: subdevice #1
  Subdevice #2: subdevice #2
  Subdevice #3: subdevice #3
  Subdevice #4: subdevice #4
  Subdevice #5: subdevice #5
  Subdevice #6: subdevice #6
  Subdevice #7: subdevice #7
card 1: Device [USB PnP Sound Device], device 0: USB Audio [USB Audio]
  Subdevices: 1/1
  Subdevice #0: subdevice #0
card 2: Loopback [Loopback], device 0: Loopback PCM [Loopback PCM]
  Subdevices: 8/8
  Subdevice #0: subdevice #0
  Subdevice #1: subdevice #1
  Subdevice #2: subdevice #2
  Subdevice #3: subdevice #3
  Subdevice #4: subdevice #4
  Subdevice #5: subdevice #5
  Subdevice #6: subdevice #6
  Subdevice #7: subdevice #7
card 2: Loopback [Loopback], device 1: Loopback PCM [Loopback PCM]
  Subdevices: 8/8
  Subdevice #0: subdevice #0
  Subdevice #1: subdevice #1
  Subdevice #2: subdevice #2
  Subdevice #3: subdevice #3
  Subdevice #4: subdevice #4
  Subdevice #5: subdevice #5
  Subdevice #6: subdevice #6
  Subdevice #7: subdevice #7

arecord -l

**** List of CAPTURE Hardware Devices ****
card 1: Device [USB PnP Sound Device], device 0: USB Audio [USB Audio]
  Subdevices: 1/1
  Subdevice #0: subdevice #0
card 2: Loopback [Loopback], device 0: Loopback PCM [Loopback PCM]
  Subdevices: 8/8
  Subdevice #0: subdevice #0
  Subdevice #1: subdevice #1
  Subdevice #2: subdevice #2
  Subdevice #3: subdevice #3
  Subdevice #4: subdevice #4
  Subdevice #5: subdevice #5
  Subdevice #6: subdevice #6
  Subdevice #7: subdevice #7
card 2: Loopback [Loopback], device 1: Loopback PCM [Loopback PCM]
  Subdevices: 8/8
  Subdevice #0: subdevice #0
  Subdevice #1: subdevice #1
  Subdevice #2: subdevice #2
  Subdevice #3: subdevice #3
  Subdevice #4: subdevice #4
  Subdevice #5: subdevice #5
  Subdevice #6: subdevice #6
  Subdevice #7: subdevice #7

sanebow · June 4, 2021, 2:32pm

I may have seen the same error when I was on my first version of the alsa config. I thought current version fixed it but obviously it didn’t. I am not sure what’s the cause of the error but you may try the following:
Comment these two lines under both playback_hw and capture_hw in alsa-aec.conf

        period_time 8000
        periods 32

If it still doesn’t work, try to replace them with this:

   period_time 0
   period_size 128     # you may change this to different values like 1024 to test
   buffer_size 4096

Finally, if all not work, try to test with a different soundcard.

sanebow · June 4, 2021, 2:35pm

Forgot to mention. Are you also running PulseAudio? If so you may stop it during testing:
systemctl --global stop pulseaudio

fastjack · June 4, 2021, 2:43pm

I’ve changed the lines in 50-aec.conf but with the changes, arecord crashes.

arecord -D aec_internal -f S16_LE -r 16000 -c 2 rec.wav -V stereo -v
Recording WAVE 'rec.wav' : Signed 16 bit Little Endian, Rate 16000 Hz, Stereo
arecord: set_params:1310: Broken configuration for this PCM: no configurations available

Without the changes, arecord works.

arecord -D aec_internal -f S16_LE -r 16000 -c 2 rec.wav -V stereo -v
Recording WAVE 'rec.wav' : Signed 16 bit Little Endian, Rate 16000 Hz, Stereo
Route conversion PCM
  Transformation table:
    0 <- 0
    1 <- 1
Its setup is:
  stream       : CAPTURE
  access       : RW_INTERLEAVED
  format       : S16_LE
  subformat    : STD
  channels     : 2
  rate         : 16000
  exact rate   : 16000 (16000/1)
  msbits       : 16
  buffer_size  : 4096
  period_size  : 128
  period_time  : 8000
  tstamp_mode  : NONE
  tstamp_type  : MONOTONIC
  period_step  : 1
  avail_min    : 128
  period_event : 0
  start_threshold  : 1
  stop_threshold   : 4096
  silence_threshold: 0
  silence_size : 0
  boundary     : 536870912
Slave: Multi PCM
  Channel bindings:
    0: slave 0, channel 0
    1: slave 1, channel 0
Its setup is:
  stream       : CAPTURE
  access       : MMAP_COMPLEX
  format       : S16_LE
  subformat    : STD
  channels     : 2
  rate         : 16000
  exact rate   : 16000 (16000/1)
  msbits       : 16
  buffer_size  : 4096
  period_size  : 128
  period_time  : 8000
  tstamp_mode  : NONE
  tstamp_type  : MONOTONIC
  period_step  : 1
  avail_min    : 128
  period_event : 0
  start_threshold  : 1
  stop_threshold   : 4096
  silence_threshold: 0
  silence_size : 0
  boundary     : 536870912
Slave #0: Rate conversion PCM (48000)
Converter: libspeex (external)
Protocol version: 10002
Its setup is:
  stream       : CAPTURE
  access       : MMAP_INTERLEAVED
  format       : S16_LE
  subformat    : STD
  channels     : 1
  rate         : 16000
  exact rate   : 16000 (16000/1)
  msbits       : 16
  buffer_size  : 4096
  period_size  : 128
  period_time  : 8000
  tstamp_mode  : NONE
  tstamp_type  : MONOTONIC
  period_step  : 1
  avail_min    : 128
  period_event : 0
  start_threshold  : 1
  stop_threshold   : 4096
  silence_threshold: 0
  silence_size : 0
  boundary     : 536870912
Slave: Direct Snoop PCM
Its setup is:
  stream       : CAPTURE
  access       : MMAP_INTERLEAVED
  format       : S16_LE
  subformat    : STD
  channels     : 1
  rate         : 48000
  exact rate   : 48000 (48000/1)
  msbits       : 16
  buffer_size  : 12288
  period_size  : 384
  period_time  : 8000
  tstamp_mode  : NONE
  tstamp_type  : MONOTONIC
  period_step  : 1
  avail_min    : 384
  period_event : 0
  start_threshold  : 3
  stop_threshold   : 12288
  silence_threshold: 0
  silence_size : 0
  boundary     : 1610612736
Hardware PCM card 1 'USB PnP Sound Device' device 0 subdevice 0
Its setup is:
  stream       : CAPTURE
  access       : MMAP_INTERLEAVED
  format       : S16_LE
  subformat    : STD
  channels     : 1
  rate         : 48000
  exact rate   : 48000 (48000/1)
  msbits       : 16
  buffer_size  : 12288
  period_size  : 384
  period_time  : 8000
  tstamp_mode  : ENABLE
  tstamp_type  : MONOTONIC
  period_step  : 1
  avail_min    : 384
  period_event : 0
  start_threshold  : 1
  stop_threshold   : 1610612736
  silence_threshold: 0
  silence_size : 0
  boundary     : 1610612736
  appl_ptr     : 0
  hw_ptr       : 0
Slave #1: Hardware PCM card 2 'Loopback' device 1 subdevice 4
Its setup is:
  stream       : CAPTURE
  access       : MMAP_INTERLEAVED
  format       : S16_LE
  subformat    : STD
  channels     : 1
  rate         : 16000
  exact rate   : 16000 (16000/1)
  msbits       : 16
  buffer_size  : 4096
  period_size  : 128
  period_time  : 8000
  tstamp_mode  : NONE
  tstamp_type  : MONOTONIC
  period_step  : 1
  avail_min    : 128
  period_event : 0
  start_threshold  : 1
  stop_threshold   : 4096
  silence_threshold: 0
  silence_size : 0
  boundary     : 536870912
  appl_ptr     : 0
  hw_ptr       : 0
                            +  ### 19%|00%+

sanebow · June 4, 2021, 2:44pm

Wait… I just reproduced your error as I noticed that I have a USB soundcard also named “Device”, may be same as yours. I took it out and test with your config and the same error was there. Let me do some testing on my side and will let you know if there’s a fix.

rolyan_trauts · June 4, 2021, 2:52pm

Should the default capture card now be the loopback that is now usb->capture->aec->loopback and not the usb card?

sanebow · June 4, 2021, 4:09pm

@fastjack This error seems to be related to some unknown PortAudio bug. I now have some work around and just uploaded a new version. Please test with this latest version (v0.3):

github.com

SaneBow/alsa-aec/blob/main/alsa-aec.conf

### ALSA virtual device which combines recording and playback loopback for AEC processing
# https://github.com/sanebow/alsa-aec
# sanebow@gmail.com
# Version: 0.3 (20210506)

defaults.pcm.aec.playback_hw.card defaults.pcm.card   # playback hw card 
defaults.pcm.aec.playback_hw.rate 48000               # sample rate supported by the playback card
defaults.pcm.aec.capture_hw.card defaults.pcm.card    # capture hw card
defaults.pcm.aec.capture_hw.rate 16000                # sample rate supported by the capture card
## you may also directly set some PCM device as playback and capture device
defaults.pcm.aec.playback_pcm "playback_hw"      
defaults.pcm.aec.capture_pcm "capture_hw"             
defaults.pcm.aec.pre_loopidx 4    # loopback subdevice index for stream before aec processing
defaults.pcm.aec.post_loopidx 5   # loopback subdevice index for stream after aec processing

pcm.playback_hw {
  @args [ CARD RATE ]
  @args.CARD { 
    type string 
    default {

This file has been truncated. show original

fastjack · June 4, 2021, 4:36pm

The fix works! Thanks!

On with the AEC testing now

ChrizZz · December 18, 2021, 12:38pm

Guys, can you explain how you configure Rhasspy in this case with NS & AEC? Do I understand this in the right way that each satellite needs to do NS and AEC or is it also possible to do this just on the base system?

I asked in another topic ways to improve Rhasspy in case of wake word detection and reduce the failure rate, @romkabouter gave the advice to search for NS & AEC. Based on the search results it looks like this is the most advanced and updated approach.

APetrycki · December 22, 2021, 7:00pm

I’m trying to get this to work. I’ve gotten to python3 ns.py -o 'Loopback ,0' --measure and it seg faults. I’m running a Pi4 with 64bit PiOS, respeaker 2mic and I copied over the aec config and added snd-aloop to /etc/modules. I originally was trying to get ec to work, but was having issues after switching to the 64bit version of PiOS, so maybe 64bit is my problem? Has anyone tested on 64bit?

sanebow · December 23, 2021, 2:23am

I haven’t tried base/satellite setup myself. But this machine learning based processors require more powerful CPU (better to be rPi 3+ or 4)

sanebow · December 23, 2021, 2:25am

I’ve tested on 64bit OS, didn’t encounter this problem.

ChrizZz · December 23, 2021, 10:33am

How does the flow look like right now? You implement NS AEC before Rhasspy (and its modules) starts working, right?

Because I think it has to be done on the system which has a connected microphone because in case you use the base station this station doesn’t know the audio stream of the satellite systems.

How does your audio recording settings in Rhasspy look like? I guess you need to change the device to avoid grabbing the raw microphone line?

sanebow · December 24, 2021, 7:06am

The NS/AEC scripts are standalone. You can setup a virtual sound card with ALSA and then configure Rhasspy to use it. Maybe it’s easier to setup on satellites. I don’t use Rhasspy, hopefully somebody else has tried to configure it and can give you some help.