Search code examples

How to play AudioStream response in AWS Polly using JavaScript SDK?

This is my script:

<script src=""></script>
    AWS.config.region = 'eu-west-1';
    AWS.config.accessKeyId = 'FOO';
    AWS.config.secretAccessKey = 'BAR';

    var polly = new AWS.Polly({apiVersion: '2016-06-10'});

    var params = {
        OutputFormat: 'mp3', /* required */
        Text: 'Hello world', /* required */
        VoiceId: 'Joanna', /* required */
        SampleRate: '22050',
        TextType: 'text'

    polly.synthesizeSpeech(params, function(err, data) {
        if (err) console.log(err, err.stack); // an error occurred
        else     console.log(data);           // successful response

The request succeeds, and I get this kind of response:

enter image description here

How do I use this kind of response? I understand that the response is deserialized audio, but how do I actually play it, say, inside a HTML5 audio element?

Furthermore, this answer on SO explains why is this type of array suitable for audio data:


  •  var uInt8Array = new Uint8Array(audioStream);
     var arrayBuffer = uInt8Array.buffer;
     var blob = new Blob([arrayBuffer]);
     var url = URL.createObjectURL(blob);
     audioElement.src = url;;

    I created a Javascript library called ChattyKathy that will handle the entire process for you if you want to take the easy way out.

    Just pass it an AWS Credentials object and then tell her what to say. She'll call AWS, transform the response, and play the audio.

    var settings = {
        awsCredentials: awsCredentials,
        awsRegion: "us-west-2",
        pollyVoiceId: "Justin",
        cacheSpeech: true
    var kathy = ChattyKathy(settings);
    kathy.Speak("Hello world, my name is Kathy!");
    kathy.Speak("I can be used for an amazing user experience!");