Amazon Polly – Text-to-Speech

Speech

Author

PubNub

Last Updated

2020-05-14

Links

Turn text into lifelike speech using deep learning

Amazon Polly supports dozens of languages and a wide selection of natural-sounding male and female voices. When you send text to Amazon Polly's API, it returns the audio to your application as a stream so you can play the voices immediately. Amazon Polly also supports lexicons and SSML tags which enable you to control aspects of speech, such as pronunciation, volume, pitch, and speed rate,

Setup

Login or Register to Admin Portal
Select Functions
Select your desired App and Keyset
Select "Explore Templates" and proceed with selected integration.
Create an account with AWS
Grab the AWS_access_key and AWS_secret_key - you will need these details to complete configuration of this integration
Select this template by using ‘Use Template’ button
Assign the module and keyset under which the integration should run
Configure AWS_secret_key and AWS_access_key
Confirm the configuration
Review the code and if it looks ok - start the function

Walkthrough

This PubNub Function listens for all chats and allows to get audio from text in private chats. To trigger text-to-speech conversion, event payload should contain 'polly' field in it with following keys: location - name of key in event payload, which contain data for polly voice - identifier of voice which should be used to read text. By default set to: 'Joanna' format - generated audio file format. By default set to: 'mp3'

location - name of key in event payload, which contain data for polly
voice - identifier of voice which should be used to read text. By default set to: 'Joanna'
format - generated audio file format. By default set to: 'mp3'

Amazon Polly – Text-to-Speech

Turn text into lifelike speech using deep learning

Setup

Walkthrough

Input:

Output:

Talk to an expert