TTS - Text to Speech
On this page |
|
What’s about
In this section, you find the blocks which can be used for text-to-speech conversion.
GoogleCloudTTS
This box lets you perform a Text-To-Speech conversion using the Google Cloud TTS*
Explore this documentation to find out How to retrieve Google Key for Cally Square blocks
An Internet connection is required for this box to work
Label: here you can type a brief description
Provider: possibility to configure Google as a Cloud Provider
Text Type: (mandatory) The available choices are Text (PlainText) or SSML (Speech Synthesis Markup Language: refer to Google official documentation to find out more about how to use it)
Text: the text you want to be translatedÂ
Language Code: the language you want to use for the translation
Voice Type (choosing one of the proposed choices like News, Standard, Studio, Polyglot…)
Voice Name: the voice of the operator (choose between Female/Male and voice type)
Speaker Type (so audio device profile like smartphones, headphones, car speakers, IVR…)
Speed, so speaking rate
Pitch to adjust the intonation of the generated voice
Exit Arrows
This box provides just one arrow out to the next step
Â
GoogleTTS
This box lets you perform a Text-To-Speech conversion using the Google TTS*.
An Internet connection is required for this box to work
Warning
We do not recommend using this block in production but only for testing. We recommend using GoogleCloudTTS instead of GoogleTTS.
Â
Label:Â here you can type a brief description
Text:Â the text you want to be translated (the maximum text length allowed is 200 characters)
Language: the language you want to use for the translation (i.e. it, en, etc..)
Exit Arrows
This box provides just one arrow out to the next step
ISpeechTTS
This box lets you perform a Text-To-Speech conversion using the Ispeech TTS Agi* Parameters
Label:Â here you can type a brief description
Text:Â the text you want to be translatedÂ
Key: your acquired license key from the ispeech.org account
Language: the language you want to use for the translation (i.e. it, en, etc..)
Speed
Interrupt key
Exit Arrows
This box provides just one arrow out to the next step.
AWSPolly
This box enables a Text-To-Speech conversion function using AWS Polly Agi Parameters*.
For additional information see https://aws.amazon.com/polly/
Label: here you can type a brief description
Access Key ID and Secret Access Key: AWS security credentials. Required: YesÂ
(see http://docs.aws.amazon.com/sdk-for-java/v1/developer-guide/credentials.html )Region: AWS regional endpoint. Required: Yes (see http://docs.aws.amazon.com/general/latest/gr/rande.html#pol_region)
Voice: the voice used for the synthesis. Required: Yes
Text: input text to synthetize. Required: Yes
Text Type: specifies whether the input text is plain text or SSML. The default value is plain text (see https://docs.aws.amazon.com/polly/latest/dg/ssml.html )
Exit Arrows
This box provides just one arrow out to the next step
LumenvoxTTS
This box lets you perform a Text-To-Speech conversion using the Lumenvox TTS
Â
Label:Â here you can type a brief description
Text:Â the text you want to be translatedÂ
Options: they control details about the synthesis. Valid options are:
l - language to use (e.g. "en-GB", "en-US", "en-AU", etc.)
v - voice name to use (e.g. "Lindsey", "Chris", etc.)
g - voice gender to use (e.g. "male", "female")
p - profile to use, as specified in the mrcp.conf file
i - digits to allow the TTS to be interrupted with (can specify "any" to allow any digits to interrupt)
f - filename on disk to store audio to (audio not stored if not specified or empty)
epe – exit on a play error
pv - prosody volume (silent/x-soft/soft/medium/load/x-loud/default)
pr - prosody rate (x-slow/slow/medium/fast/x-fast/default)
You are not required to supply any options. Multiple options can be provided by joining options with an ampersand, e.g. l=en-US&g=female
Exit Arrows
This box provides just one arrow out to the next step
SestekTTS
available from rel. 2.0.84
This box lets you perform a Text-To-Speech conversion using the Sestek TTS.
Â
Label:Â here you can type a brief description
Text:Â the text you want to be translatedÂ
Options: they control details about the synthesis. Valid options are:
l - language to use (e.g. "en-GB", "en-US", "en-AU", etc.)
v - voice name to use (e.g. "Lindsey", "Chris", etc.)
g - voice gender to use (e.g. "male", "female")
p - profile to use, as specified in the mrcp.conf file
i - digits to allow the TTS to be interrupted with (can specify "any" to allow any digits to interrupt)
f - filename on disk to store audio to (audio not stored if not specified or empty)
epe – exit on a play error
pv - prosody volume (silent/x-soft/soft/medium/load/x-loud/default)
pr - prosody rate (x-slow/slow/medium/fast/x-fast/default)
You are not required to supply any options. Multiple options can be provided by joining options with an ampersand, e.g. l=en-US&g=female
Exit Arrows
This box provides just one arrow out to the next step
MRCPSynth
available from rel. 2.5.2
This box lets you perform a Text-To-Speech conversion using the Sestek TTS.
Â
Label:Â here you can type a brief description
Text:Â the text you want to be translatedÂ
Options: they control details about the synthesis. Valid options are:
l - language to use (e.g. "en-GB", "en-US", "en-AU", etc.)
v - voice name to use (e.g. "Lindsey", "Chris", etc.)
g - voice gender to use (e.g. "male", "female")
p - profile to use, as specified in the mrcp.conf file
i - digits to allow the TTS to be interrupted with (can specify "any" to allow any digits to interrupt)
f - filename on disk to store audio to (audio not stored if not specified or empty)
epe – exit on a play error
pv - prosody volume (silent/x-soft/soft/medium/load/x-loud/default)
pr - prosody rate (x-slow/slow/medium/fast/x-fast/default)
You are not required to supply any options. Multiple options can be provided by joining options with an ampersand, e.g. l=en-US&g=female
Exit Arrows
This box provides just one arrow out to the next step