Listener Gets Another VAD

Over this summer, I wanted to make Listener Python only (there was a ton of needless applescript) and since my work @ VACCINE [VACCINE homepage] (more on this in a new post) didn’t leave much time for tinkering with things, I had to push this back as much as I could. I also added another voice activity detection algorithm which I believe works a lot better than what I had previously. And I also moved my project over to GITHUB.

The additions to Listener:

  • A new Voice Activity Detection algorithm [ link to pdf by Moattar and Homayounpour].
  • Skype4Py is still buggy on OS X with Python2.5 and it takes a nice solid dump (segfault) with Python2.6 so I am sure if the Skype part of the code works.
  • Python only. I removed the applescript portion from my code.

I have moved the entire code over to github: http://github.com/shriphani/Listener/ . The VAD algorithms can be seen in the file VAD.py.

This VAD also works by starting off with a base threshold for energy, power and an attribute called Spectral Flatness Measure. Luckily the paper had pseudocode so my DSP n00bishness wouldn’t get in the way of progress.

Anyway, to get this version of Listener running, download Listener.tar , untar it and run:

python2.5 audio_analysis.py

And you should be all set.


About this entry