TTS - Text to Speech

What’s about

In this section, you find the blocks which can be used for text-to-speech conversion.

GoogleCloudTTS

This box lets you perform a Text-To-Speech conversion using the Google Cloud TTS*

Explore this documentation to find out How to retrieve Google Key for Cally Square blocks

  • An Internet connection is required for this box to work

image-20240530-153802.png
  • Label: here you can type a brief description

  • Provider: possibility to configure Google as a Cloud Provider

  • Text Type: (mandatory) The available choices are Text (PlainText) or SSML (Speech Synthesis Markup Language: refer to Google official documentation to find out more about how to use it)

  • Text: the text you want to be translated 

  • Language Code: the language you want to use for the translation

  • Voice Type (choosing one of the proposed choices like News, Standard, Studio, Polyglot…)

  • Voice Name: the voice of the operator (choose between Female/Male and voice type)

  • Speaker Type (so audio device profile like smartphones, headphones, car speakers, IVR…)

  • Speed, so speaking rate

  • Pitch to adjust the intonation of the generated voice

Exit Arrows

This box provides just one arrow out to the next step

 

GoogleTTS

This box lets you perform a Text-To-Speech conversion using the Google TTS*.

  • An Internet connection is required for this box to work

Warning

We do not recommend using this block in production but only for testing. We recommend using GoogleCloudTTS instead of GoogleTTS.

 

  • Label: here you can type a brief description

  • Text: the text you want to be translated (the maximum text length allowed is 200 characters)

  • Language: the language you want to use for the translation (i.e. it, en, etc..)

Exit Arrows

This box provides just one arrow out to the next step

ISpeechTTS

This box lets you perform a Text-To-Speech conversion using the Ispeech TTS Agi* Parameters

  • Label: here you can type a brief description

  • Text: the text you want to be translated 

  • Key: your acquired license key from the ispeech.org account

  • Language: the language you want to use for the translation (i.e. it, en, etc..)

  • Speed

  • Interrupt key

Exit Arrows

This box provides just one arrow out to the next step.

AWSPolly

This box enables a Text-To-Speech conversion function using AWS Polly Agi Parameters*.

For additional information see https://aws.amazon.com/polly/

Exit Arrows

This box provides just one arrow out to the next step

LumenvoxTTS

This box lets you perform a Text-To-Speech conversion using the Lumenvox TTS

 

  • Label: here you can type a brief description

  • Text: the text you want to be translated 

  • Options: they control details about the synthesis. Valid options are:

    • l - language to use (e.g. "en-GB", "en-US", "en-AU", etc.)

    • v - voice name to use (e.g. "Lindsey", "Chris", etc.)

    • g - voice gender to use (e.g. "male", "female")

    • p - profile to use, as specified in the mrcp.conf file

    • i - digits to allow the TTS to be interrupted with (can specify "any" to allow any digits to interrupt)

    • f - filename on disk to store audio to (audio not stored if not specified or empty)

    • epe – exit on a play error

    • pv - prosody volume (silent/x-soft/soft/medium/load/x-loud/default)

    • pr - prosody rate (x-slow/slow/medium/fast/x-fast/default)

You are not required to supply any options. Multiple options can be provided by joining options with an ampersand, e.g. l=en-US&g=female

Exit Arrows

This box provides just one arrow out to the next step

SestekTTS

available from rel. 2.0.84

This box lets you perform a Text-To-Speech conversion using the Sestek TTS.

 

  • Label: here you can type a brief description

  • Text: the text you want to be translated 

  • Options: they control details about the synthesis. Valid options are:

    • l - language to use (e.g. "en-GB", "en-US", "en-AU", etc.)

    • v - voice name to use (e.g. "Lindsey", "Chris", etc.)

    • g - voice gender to use (e.g. "male", "female")

    • p - profile to use, as specified in the mrcp.conf file

    • i - digits to allow the TTS to be interrupted with (can specify "any" to allow any digits to interrupt)

    • f - filename on disk to store audio to (audio not stored if not specified or empty)

    • epe – exit on a play error

    • pv - prosody volume (silent/x-soft/soft/medium/load/x-loud/default)

    • pr - prosody rate (x-slow/slow/medium/fast/x-fast/default)

You are not required to supply any options. Multiple options can be provided by joining options with an ampersand, e.g. l=en-US&g=female

Exit Arrows

This box provides just one arrow out to the next step

MRCPSynth

available from rel. 2.5.2

This box lets you perform a Text-To-Speech conversion using the Sestek TTS.

 

  • Label: here you can type a brief description

  • Text: the text you want to be translated 

  • Options: they control details about the synthesis. Valid options are:

    • l - language to use (e.g. "en-GB", "en-US", "en-AU", etc.)

    • v - voice name to use (e.g. "Lindsey", "Chris", etc.)

    • g - voice gender to use (e.g. "male", "female")

    • p - profile to use, as specified in the mrcp.conf file

    • i - digits to allow the TTS to be interrupted with (can specify "any" to allow any digits to interrupt)

    • f - filename on disk to store audio to (audio not stored if not specified or empty)

    • epe – exit on a play error

    • pv - prosody volume (silent/x-soft/soft/medium/load/x-loud/default)

    • pr - prosody rate (x-slow/slow/medium/fast/x-fast/default)

You are not required to supply any options. Multiple options can be provided by joining options with an ampersand, e.g. l=en-US&g=female

Exit Arrows

This box provides just one arrow out to the next step

Related topics