Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Patent
1997-01-28
1999-04-20
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
704246, 704251, G01L 906
Patent
active
058954473
ABSTRACT:
Clusters of quantized feature vectors are processed against each other using a threshold distance value to cluster mean values of sets of parameters contained in speaker specific codebooks to form classes of speakers against which feature vectors computed from an arbitrary input speech signal can be compared to identify a speaker class. The number of codebooks considered in the comparison may be thus reduced to limit mixture elements which engender ambiguity and reduce system response speed when the speaker population becomes large. A speaker class processing model which is speaker independent within the class may be trained on one or more members of the class and selected for implementation in a speech recognition processor in accordance with the speaker class recognized to further improve speech recognition to level comparable to that of a speaker dependent model. Formation of speaker classes can be supervised by identification of groups of speakers to be included in the class and the speaker class dependent model trained on members of a respective group.
REFERENCES:
patent: Re31188 (1983-03-01), Pirz et al.
patent: 4181821 (1980-01-01), Pirz et al.
patent: 4363102 (1982-12-01), Holmgren et al.
patent: 5165095 (1992-11-01), Borcherding
patent: 5608840 (1997-03-01), Tsuboka
patent: 5608841 (1997-03-01), Tsuboka
patent: 5638489 (1997-06-01), Tsuboka
Tetsuo Kosaka and Shigeki Sagayama, "Tree-Structured Speaker Clustering for Fast Speaker Adaptation," Proc. ICASSP 94, vol. I, pp. 245-248, May 1994.
Ananth Sankar, Francoise Beaufays, and Vassilios Digalakis, "Training Data Clustering for Improved Speech Recognition," Proc. Eurospeech 95, pp. 503-506, Sep. 1995.
Mukund Padmanabhan, Lalit R. Bahl, David Nahamoo, and Michael A. Picheny, "Speaker Clustering and Transformation for Speaker Adaptation in Large-Vocabulary Speech Recognition Systems," Proc. ICASSP 96, vol. II, pp. 701-704, May 1996.
Lawrence R. Rabiner and Ronald W. Schafer, Digital Processing of Speech Signals, Prentice-Hall, pp. 485-489, 1978.
Ittycheriah Abraham Poovakunnel
Maes Stephane Herman
Hudspeth David R.
International Business Machines - Corporation
Smits Talivaldis Ivars
Tassinari, Jr. Robert P.
LandOfFree
Speech recognition using thresholded speaker class model selecti does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speech recognition using thresholded speaker class model selecti, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech recognition using thresholded speaker class model selecti will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2245234