Speech recognition using thresholded speaker class model selecti

Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

704246, 704251, G01L 906

Patent

active

058954473

ABSTRACT:
Clusters of quantized feature vectors are processed against each other using a threshold distance value to cluster mean values of sets of parameters contained in speaker specific codebooks to form classes of speakers against which feature vectors computed from an arbitrary input speech signal can be compared to identify a speaker class. The number of codebooks considered in the comparison may be thus reduced to limit mixture elements which engender ambiguity and reduce system response speed when the speaker population becomes large. A speaker class processing model which is speaker independent within the class may be trained on one or more members of the class and selected for implementation in a speech recognition processor in accordance with the speaker class recognized to further improve speech recognition to level comparable to that of a speaker dependent model. Formation of speaker classes can be supervised by identification of groups of speakers to be included in the class and the speaker class dependent model trained on members of a respective group.

REFERENCES:
patent: Re31188 (1983-03-01), Pirz et al.
patent: 4181821 (1980-01-01), Pirz et al.
patent: 4363102 (1982-12-01), Holmgren et al.
patent: 5165095 (1992-11-01), Borcherding
patent: 5608840 (1997-03-01), Tsuboka
patent: 5608841 (1997-03-01), Tsuboka
patent: 5638489 (1997-06-01), Tsuboka
Tetsuo Kosaka and Shigeki Sagayama, "Tree-Structured Speaker Clustering for Fast Speaker Adaptation," Proc. ICASSP 94, vol. I, pp. 245-248, May 1994.
Ananth Sankar, Francoise Beaufays, and Vassilios Digalakis, "Training Data Clustering for Improved Speech Recognition," Proc. Eurospeech 95, pp. 503-506, Sep. 1995.
Mukund Padmanabhan, Lalit R. Bahl, David Nahamoo, and Michael A. Picheny, "Speaker Clustering and Transformation for Speaker Adaptation in Large-Vocabulary Speech Recognition Systems," Proc. ICASSP 96, vol. II, pp. 701-704, May 1996.
Lawrence R. Rabiner and Ronald W. Schafer, Digital Processing of Speech Signals, Prentice-Hall, pp. 485-489, 1978.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Speech recognition using thresholded speaker class model selecti does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Speech recognition using thresholded speaker class model selecti, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech recognition using thresholded speaker class model selecti will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2245234

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.