Mimic 3 Text to Speech Launch!

synesthesiam · June 29, 2022, 12:48pm

Hi everyone! Today is the day that Mimic 3 is officially launched

The official docs talk about all the different ways you can try it for yourself

I will be making an update to Rhasspy soon to add Mimic 3 as a TTS option, which I would recommend over Larynx now. Especially for the Pi 3 and 4, I’d say this is a big win for open source offline TTS!

Let me know what you think, and have a good day.

paddy0174 · June 30, 2022, 11:12am

This is what I think: Awesome

hugocoolens · July 3, 2022, 9:36am

This would be a great addition to Rhasspy!

rejoe2 · July 4, 2022, 5:37pm

Glad to hear about plans for an update for Rhasspy .

I recently also did some testing with Mimic 3 as a drop-in replacement for MaryTTS and can now confirm this method to work flawlessly.

May I kindly ask for plans to integrate the most recent version of the “Thorsten” dataset to Mimic 3 as well? ( Thorsten-22.05-neutral)

synesthesiam · July 5, 2022, 3:16pm

Definitely

AndreKR · July 5, 2022, 5:03pm

And don’t forget harvard-glow_tts please.

rejoe2 · July 10, 2022, 6:44am

Seems I didn’t get deep enough into all the details…?
Afai now found out, the “thorsten_low” voice is already based on the (preliminary) version now named 22.05-neutral. So most likely atm there’s no specific action required.

Jarvy · July 14, 2022, 2:54am

Glad to hear it’s coming to Rhasspy! I will probably go through some efforts in getting it added in Home Intent for 64bit pi os’s. The bit of experimenting I’ve done has been incredibly promising, the realtime factor is really good. Thanks for putting it out there!

kaykoch · July 14, 2022, 4:01pm

I installled mimic3 as a docker image:
(Before I had google wavenet)
https://mycroft-ai.gitbook.io/docs/mycroft-technologies/mimic-tts/mimic-3

After that, I changed in my rhasbian Server (I have Server / Satelllites):
preferences → Text to Speech to MaryTTS
After save and restart, I changed
URL to http://10.2.254.251:59125/process (Thats the IP of the docker home)
After save and restart, I refreshed available voices
Now I could pick up my language/voice.

This works!
Great !! Thank You

Problems with german language:
“.” vs. “,” :
11.00 → Elf tausend (eleven thousand)
11,00 → Elf komma Null (eleven point zero)
(So I add a replace command in Home Assistant to temperature values)

Time:
11:30 → elf dreißig (eleven thirty, wrong)
11:30h → elf Uhr dreißig (right)

Beginning and End of sentences:
If rhasbian speaks a sentence, there are missing words in the beginning and at the end.
With home assistant there is a strange sound at the end, if you finish the sentence with a point (.) No words missing

APetrycki · July 26, 2022, 12:44pm

I see it supports the use of CUDA acceleration…how difficult would it be to support Google Coral?

synesthesiam · July 29, 2022, 2:58am

It depends on the kind of networks that Coral supports. Last I checked, they don’t support transformer networks

itsMattShull · September 23, 2022, 2:47am

Excited to see this implemented into Rhasspy!

KiboOst · October 3, 2022, 4:01pm

Nice to see interest in a future for Rhasspy!

Mimic3 seems very good, but actually there is some obvious problems with the french pronunciation

kajelo · October 6, 2022, 4:12pm

Agree, excited to see it implemented into Rhasspy but some problems with french pronounciations Examples: ‘2’, ‘2022’
both examples are (always) pronounced in liaison /dœ.z‿/, but should be /dø/ here

Damn · November 17, 2023, 12:15pm

uhh sorry didn’t realize this is an old thread…