Hi,
I have a bunch of mp3s which are conference call recordings. They usually start with some background music, then human voice and then music or silence again.
I know that speex supports voice activity detection to encode better.
Is there any way I can use this to mark the beginning and end of voice activity in the file?
If it cannot work on mp3, would it work on raw sound input or any other formats?
Regds
Madhu
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.xiph.org/pipermail/speex-dev/attachments/20111021/aeb27e2e/attachment.htm