Search code examples
amazon-web-servicesamazon-s3amazon-connect

Issue with Amazon Transcibe


I have a few recording files (audio files from Amazon connect .wav format) is stored in the S3 bucket.
I followed this link (https://aws.amazon.com/getting-started/tutorials/create-audio-transcript-transcribe/) to convert the audio files to Transcript using Amazon Transcribe.

It is successfully converted to Transcript.

NOTE: the voice in the audio files are very clear but the only thing is it was there in the decent Indian accent (Not in UK / US accent)

Surprisingly, It is not able to detect many words correctly. Most of the words were wrong

1) Is there a setting, I need to configure for detecting the Indian accent?
2) Is there anyone tested with US / UK accent and detected at least 80 percent correct?
3) Can anyone suggest What is the other tool anyone prefers for converting the audio to text?

Thanks,
Harry


Solution

  • You can try to transcribe with Indian English with code

    Indian English (en-IN)

    Hope it does not start to misbehave for the parts that are in different accents like UK/US. But still worth a try. Otherwise you may need to think about splitting the audio in different parts, which obvioulsy will be cumbersome.