Detect End if Speech

I somehow ran into the problem that I needed to detect when the person had finished talking using rhasspy, but to do that I had to use listen-for-command to make the recording end when the person was silent.

Can I just somehow start recording audio and end when the person is silent, for example with start-recording.
Since listen-for-command tries to understand what kind of intent it is and causes a bunch of logic I don’t need, I just need to record the whole speech from the beginning of speech to silence and get it somehow. How is this possible to do?