Search code examples
speech-recognitionspeech-to-textazure-cognitive-servicesmicrosoft-speech-api

Azure Cognitive Services Speech to Text large/long audio files sample


Like to transcribe a couple of long (Dutch) audio files. They are interviews which are about 60-120 minutes per file in length. Got only 8 files which I need to do manually, so not necessarily part of some automated software. Got some Azure credits, so thought to go with Azure Cognitive Services Speech to Text. Is there a sample somewhere for that?

Tried this sample: https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/speech-to-text-sample. Works fine. But stops right away after a small pause in the audio.

Saw a similar question here: Speech-to-text large audio files [Microsoft Speech API]. But the poster didn't share back how he solved it.

Can somebody help out?


Solution

  • For longer audio files, we recommend the batch transcription APIs. A good explanation is here: https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/batch-transcription and there are samples for C# and Python here: https://github.com/Azure-Samples/cognitive-services-speech-sdk/tree/master/samples/batch.