Method and apparatus for text-independent speaker recognition

Electrical audio signal processing systems and devices – One-way audio signal program distribution – Public address system

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

3645135, G10L 100

Patent

active

047208635

ABSTRACT:
A method and apparatus for recognizing an unknown speaker from a plurality of speaker candidates. Portions of speech from the speaker candidates and from the unknown speaker are sampled and digitized. The digitized samples are converted into frames of speech, each frame representing a point in an LPC-12 multi-dimensional speech space. Using a character covering algorithm, a set of frames of speech is selected, called characters, from the frames of speech of all speaker candidates. The speaker candidates' portions of speech are divided into smaller portions called segments. A smaller plurality of model characters for each speaker candidate is selected from the character set. For each set of model characters the distance from each speaker candidate's frame of speech to the closest character in the model set is determined and stored in a model histogram. When a model histogram is completed for a segment a distance D is found whereby at least a majority of frames have distances greater D. The mean distance value of D and variance across all segments for both speaker and imposter is then calculated. These values are added to the set of model characters to form the speaker model. To perform recognition the frames of the unknown speaker as they are received are buffered and compared with the sets of model characters to form model histograms for each speaker. A likelihood ratio is formed. The speaker candidate with the highest likelihood ratio is chosen as the unknown speaker.

REFERENCES:
patent: Re31188 (1983-03-01), Pirz et al.
patent: 4092493 (1978-05-01), Rabiner et al.
patent: 4191853 (1980-03-01), Riesinger
patent: 4292471 (1981-09-01), Kuhn et al.
patent: 4301329 (1981-11-01), Taguchi et al.
patent: 4343969 (1982-08-01), Kellett
patent: 4389540 (1983-06-01), Nakamura et al.
patent: 4426551 (1984-01-01), Komatsu et al.
patent: 4472832 (1984-09-01), Atal et al.
patent: 4488243 (1984-12-01), Brown et al.
IEEE Trans. on Audio and Electroacoustics, vol. AU-21, No. 3, 6/73, pp. 140-141, Makhoul.
Abstract for Proceedings of the 1981 IEEE International Conference on Acoustics, Speech and Signal entitled,"Speaker Independent and Verification Combined with Speaker Independent Word Recognition, by A. E. Rosenberg and K. L. Shipley.
Abstract for an IEEE Transactions on Acoustics Speech and Signal Processing entitled,"On Creating Reference Templates for Speaker Independent Recognition of Isolated Words" by L. R. Rabiner.
Independent Recognition of Isolated Words", by L. R. Rabiner.
Atal, B. S. (1974), "Effectiveness of Linear Prediction Characteristics of the Speech Waves for Automatic Speaker Identification and Verification," J. Acoust. Soc. Amer., vol. 55, pp. 1304-1312, 1974.
Atal, B. S. (1976), "Automatic Recognition of Speakers from their Voices," Proceedings of the IEEE, vol. 64, pp. 460-475, Apr. 1976.
Markel, J. D. and Davis, S. B. (1979), "Text Independent Speaker Recognition from a Large Linguistically Unconstrainted Time-Spaced Data Base," IEEE Trans. on Acoustics, Speech, and Signal Processing, vol. ASSP-27, pp. 74-82, Feb. 1979.
Markel, J. D., Oshika, B. T., and Gray, A. H., Jr. (1977) "Long-Term Feature Averaging for Speaker Recognition," IEEE Trans. on Acoustics, Speech, and Signal Processing, vol. ASSP-25, pp. 330-337, Aug. 1977.
Wohlford, R. E., Wrench, E. H., and Landell, B. P. (1980), "A Comparison of Four Techniques for Automatic Speaker Recognition," Proc. ICASSP-80, vol. 3, pp. 908-911, 1980.
Wrench, E. H. (1981), "A Realtime Implementation of a Text Independent Speaker Recognition System," Proc. ICASSP-81, vol. 1, pp. 193-196 (1981).
Li, K. P., and Hughes, G. W. (1974), "Talker Differences as they Appear in Correlation Matrices of Continuous Speech Spectra," J. Acoust. Soc. Amer., vol. 55, pp. 833-837, Apr. 1974.
Wakita, H. (1976), "Residual Energy of Linear Prediction Applied to Vowel and Speaker Recognition," IEEE Trans. on Acoustics, Speech, and Signal Processing, vol. ASSP-24, pp. 270-271, 1976.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and apparatus for text-independent speaker recognition does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and apparatus for text-independent speaker recognition, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for text-independent speaker recognition will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-374759

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.