Search code examples
azuretext-to-speechazure-cognitive-servicesspeech

Azure Text to Speech API - Limited to 10 Minutes of Audio?


Is there a limit to the amount of text which can be submitted to the TTS (neural) Speech Service endpoints?

All of the requests I'm making from an Azure Function are successful but have a cutoff at 10 minutes exactly.


Solution

  • Yes, it is stated in the old Bing Speech API documentation that the Speech Service places limitations on the duration of the WebSocket connections to the service with a maximum duration of 10 minutes for active WebSocket connection and a maximum of 180 seconds for inactive.

    UPDATE

    It is also stated in the new Speech Service documentation that an access token is valid for 10 minutes.