Search code examples
speech-recognitioncmusphinxsphinx4

CMU Sphinx Acoustic model training hangs


I am training an acoustic model with CMU sphinx. I have around 200 hours of speech data. When I run the training script (sphinxtrain run, initially 4 of my CPU cores was at 100% and my machine got hot so it was doing some calculations. However, now the script looks looks like it is hang at Module 20, Phase 3. Upon inspection, I've realized that 4 copies of "Perl 5.12" are running in my machine with 0% CPU utilization and updating a file called qmanager/bw.2.4.out in the training directory (Baum Welch model?). This file is constantly updated; I have an SSD drive.

My question is whether this is normal (that CPU usage is at 0%) and is there a way to speed up the training.


Solution

  • My question is whether this is normal (that CPU usage is at 0%) and is there a way to speed up the training.

    No there was an error. You can check details in logs in logdir folder. Most likely you incorrectly specified a path to some data file or to the library. Sometimes it's ok to restart.

    Make sure an4 tutorial works for you first.

    there a way to speed up the training

    It must be pretty fast. If you enable training on 8 cores a model on 200 hours should train in 1 day.