I'm trying out the Microsoft cognitive services, Speaker Recognition API in ASP.NET Core. However I've no idea how to give the audio file as the input of the following supported format:
Container- WAV
Encoding- PCM
Rate- 16K
Sample Format- 16 bit
Channels- Mono
Can someone please help me out with this?
Thank you.
To configure the input mode to be wav:
// Configure the input to the recognizer.
recognizer.SetInputToWaveFile(@"c:\temp\SampleWAVInput.wav");
This article discusses how to recognize audio files with the recognizer in the System.Speech namespace.
you can also try with SetInputToAudioStream() with the format of the input.