Add support for additional TTS integrations through non-Microsoft focused SpeechService interface

EDDI currently uses whatever built in Windows TTS system is installed.  Unfortunately, the built in Windows TTS are not particularly good.

This feature request is to ask for a better more modular SpeechService class that allows other speech engines to "plugin" that do not rely on the Windows TTS interfaces and provide the same WAV stream as the existing class uses.

Examples of other engines could include (but are not limited to):

- Amazon Polly - https://ai-service-demos.go-aws.com/polly
- Google - https://cloud.google.com/text-to-speech
- Microsoft Azure - https://azure.microsoft.com/en-us/services/cognitive-services/text-to-speech/
- Different versions of the SAPI interface

As a proof of concept, here is an Amazon polly implementation I created.

https://gist.github.com/druggedhippo/0a887973ee019dea1fc9e522f513b0f5

Example audio of Amazon Polly processing a EDDI TTS prompt in real-time:

https://imgur.com/zyoWmQg


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for additional TTS integrations through non-Microsoft focused SpeechService interface #2379

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add support for additional TTS integrations through non-Microsoft focused SpeechService interface #2379

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions