I need to determine when someone speaks in an audio stream. I applied the Hamming window and calculated the FFT. How do i detect the human voice from here?
You don't need to do an FFT for this, you need to implement a Voice Activity Detection algorithm.