Amazon Polly – Text-to-Speech

Turn text into lifelike speech using deep learning

Amazon Polly supports dozens of languages and a wide selection of natural-sounding male and female voices. When you send text to Amazon Polly's API, it returns the audio to your application as a stream so you can play the voices immediately. Amazon Polly also supports lexicons and SSML tags which enable you to control aspects of speech, such as pronunciation, volume, pitch, and speed rate,

Setup

  1. Login or Register to Admin Portal

  2. Select Functions

  3. Select your desired App and Keyset

  4. Select "Explore Templates" and proceed with selected integration.

Walkthrough:

Begin by adding the AWS keys to the vault storage by using the MY SECRETS feature from within the blocks editor. The block will need the AWS_access_key and AWS_secret_key values.

This PubNub Functions listens for all chats and allows to get audio from text in private chats.

To trigger text-to-speech conversion, event payload should contain 'polly' field in it with following keys:

  • location - name of key in event payload, which contain data for polly

  • voice - identifier of voice which should be used to read text. By default set to: 'Joanna'

  • format - generated audio file format. By default set to: 'mp3'

Input:

1

Output:

1

Talk to an expert