Can any one Direct me on creating a Speaker Recognition System in Java?

What API is to use to detect the speaker's voice?
What is the fundamental approach to follow in order to identify a persons voice irrespective of the programming language and How that approach can be implemented in Java?

In general, you are getting into a huge area of artificial intelligence research applicable to many kinds of pattern recognition, not just acoustic. For simplicity, lets divide the problem.

1. Capture of wave forms of speech samples - the raw data of course is huge so it has to be treated to derive some sort characteristics which can be mathematically manipulated. The FFT - fast fourier transform is popular - you get time series of signal strength in various frequency bins. Of course you can do this in Java.

2. Trying to come up with some mechanism to recognize the characteristics of speakers from the reduced data sets. Personally I have always been attracted to genetic algorithms (thats just one example, a HUGE number of Java experiments with genetic algorithms and other classification schemes can be found.)

Like I said, a huge area - good hunting

Bill

Varuna Seneviratna
Ranch Hand

Joined: Jan 15, 2007
Posts: 167

posted Jun 02, 2011 04:11:25

0

William Brogden wrote:In general, you are getting into a huge area of artificial intelligence research applicable to many kinds of pattern recognition, not just acoustic. For simplicity, lets divide the problem.

1. Capture of wave forms of speech samples - the raw data of course is huge so it has to be treated to derive some sort characteristics which can be mathematically manipulated. The FFT - fast fourier transform is popular - you get time series of signal strength in various frequency bins. Of course you can do this in Java.

It took Dragon Systems over a decade of research and tens of millions of dollars to create Dragon Naturally Speaking. There were over a dozen scientists in the research group, and eight of us in the product group.

I don't think this is a problem you are going to solve on your own from scratch.

Bear Bibeault wrote:It took Dragon Systems over a decade of research and tens of millions of dollars to create Dragon Naturally Speaking. There were over a dozen scientists in the research group, and eight of us in the product group.

I don't think this is a problem you are going to solve on your own from scratch.

The system I have in mind is not as complicated as Dragon Naturally Speaking, what it does is it detects a speaker's identity using the speaker's voice and a word phrase the speaker has previously input to the System Database