Search code examples
audiosignal-processingspeech

Speech/ Music classification


I want to determine which part of audio file contain speech or music.

I hope someone has a made something like this or can tell me where to start. Can you please suggest some method/tutorial for doing the same.

Thank you.


Solution

  • There's lots of prior art in this area, but I'd suggest browsing through some of Dan Ellis's papers. The slides for this talk has some good background. In short it's all down to picking the right feature vectors.