Search code examples
c#speech

How to identify speaker from voice pattern using Microsoft Speech?


I'm using Microsoft Speech C# API for Home Automation commands

I'd like to know if there is a way or built-in C# method to hash Voice Input and recognize who's speaking. If it is Alice or Bob to say "Hello Alice" or "Hello Bob".

EDIT:

Microsoft Speech API can provides a .wav of the recording. It might be able to hash, process, ... to understand who's speaking:

  • Loud voice, slow modulation, ... => Bob
  • High voice, fast modulation, ... => Alice

Solution

  • Speaker recognition is a hard problem and is still an active research area. I don't think Microsoft speech api has any speaker recognition support, but not 100% sure.

    I found the following article really helpful while researching the topic. It introduces the subject and also provides a very crude implementation. Probably a good place to start.

    http://www.ibm.com/developerworks/opensource/library/os-sndpeek/index.html