Search code examples
javaspeech-to-textsphinx4

Hindi speech to text using sphinx4 in java


I am using Sphinx4 to convert speech to text, it works fine for English, but how can I use it for Hindi speech. For example i said "Mera name amit hai". After converting to text the output should be "मेरा नाम अमित है"


Solution

  • You can download Hindi acoustic model to use it with sphinx4 here:

    https://sourceforge.net/projects/hindiasr/files/Hindiasr/HindiASR-2.0/

    However, it's in a very basic state and it's unlikely it will have high accuracy. For better accuracy you might want to try an acoustic model yourself, see the tutorial for details

    http://cmusphinx.sourceforge.net/wiki/tutorial

    The setup of the model in Eclipse is simple:

    1) download the latest code as described in the tutorial:

    http://cmusphinx.sourceforge.net/wiki/tutorialsphinx4

    2) Open transcriber demo and edit path to the acoustic model and the dictionary to point to models downloaded from hindiasr

    3) Create a grammar or a language model as described in lm tutorial

    http://cmusphinx.sourceforge.net/wiki/tutoriallm

    4) Replace the file for recognition in TranscriberDemo.java.

    5) Run and see it recognizes the file