I’ve been following Rhasspy for a few weeks now and I’m pretty impressed how fast the community is growing. Because it has so much overlap with my own project I’m trying to think of ways how to combine them so they can benefit from each other.
SEPIA is an open-source assistant that does:
- speech-to-text (it even has the same Zamia Speech Kaldi support Rhasspy uses)
- dialog-to-service (skills, actions)
It is also a framework that has some additional features like:
- Java based core server (light-weight, <100MB)
- Customizable cross-platform clients for browser, Android and iOS (iOS not yet in Apple store)
- User account management
- SDK to build services in Java and upload them to SEPIA (kind of like a skill store, works global or per-user)
- Python bridge to implement Python code for intent handling right into the SEPIA NLU chain
Since speech-to-text and text-to-intent is handled by Rhasspy as well I was thinking to connect SEPIA services to Rhasspy via SEPIA’s ‘answer’ REST endpoint or implementing Rhasspy’s intent handling into SEPIA via the Python bridge. It would also be great if we could make the STT modules compatible.
The benefits I see for both systems are:
- Rhasspy could get access to dozens of SEPIA services (weather, navigation, to-do lists, reminders, music, timers, smart home control, news, etc.) and SEPIA’s services SDK
- SEPIA could benefit from Rhasspy’s combination of intent extraction and speech-recognition
- Rhasspy could be accessed via SEPIA’s web-based clients
Let me know what you think