Android engine/engines for TTS and STT/ASR #1392
paolo-caroni
started this conversation in
Ideas
Replies: 1 comment 12 replies
-
WhT you think about it? |
Beta Was this translation helpful? Give feedback.
12 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am writing here because I think the topic is more suitable for discussions than "issues". As requested by several people (see #852 #569 #1088 #569 ), sherpa-onnx would need an engine app for TTS (and a separate or merged one for STT/ASR #580 ). Android offers a standard API for TTS that is currently partially implemented by android .apk packages that support one language at a time and that do not allow the use of multilingual apps (or change language easy).
To save space on devices and implement the choice of language, the engine app should be a single one and allow the language models to be installed separately (without being already present in the package).
An attempt to start the work in this direction was started by @jing332 at the repository but the development stopped at the initial stage.
I would like to understand how I can help in the development of a stable and working app that can reach the level of integration at least equal to eSpeak (I would hope even higher).
Could a fundraiser be useful?
As for the STT/ASR engine the situation is worse. Currently the API is not supported at all. Language detection support DETECTED_LANGUAGE is possible, as it is currently done by a spoken language identification apk that doesn't seem to interface with the API.
Sherpa-onnx seems to have all the capabilities to support language detection, multi-language support, confidence level and more.
Regarding merging or keeping separate TTS and STT, the models are different, so for the same language you would need a model for TTS and a model for STT, so it would probably be easier to have separate engines.
As pointed out here the purpose of the android packages does not seem to be to offer an engine that supports the native android API, but rather to provide examples on how to directly integrate sherpa-onnx in each app with a different API. This would require, if I understood correctly, that each app that listens or spoke to the user should integrate sherpa-onnx, which for obvious reasons makes no sense on Android platform.
Again, I would not be able to write useful code, but I could think about participating in a crowdfunding.
You who have been interested in the topic previously, what do you think?
@mablue @jing332 @csukuangfj @thewh1teagle @domasofan @gyroing
Beta Was this translation helpful? Give feedback.
All reactions