Self-learning speaker adaptation based on spectral variation sou

Data processing: vehicles – navigation – and relative location – Navigation – Employing position determining equipment

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

704240, 704242, 704256, G10L 506, G10L 708

Patent

active

056640595

ABSTRACT:
A self-learning speaker adaptation method for automatic speech recognition is provided. The method includes building a plurality of Gaussian mixture density phone models for use in recognizing speech. The Gaussian mixture density phone models are used to recognize a first utterance of speech from a given speaker. After the first utterance of speech has been recognized, the recognized first utterance of speech is used to adapt the Gaussian mixture density hone models for use in recognizing a subsequent utterance of speech from that same speaker, whereby the Gaussian mixture density phone models are automatically adapted to that speaker in self-learning fashion to thereby produce a plurality of adapted Gaussian mixture density phone models.

REFERENCES:
patent: 4363102 (1982-12-01), Holmgren et al.
patent: 4903305 (1990-02-01), Gillick et al.
patent: 5033087 (1991-07-01), Bahl et al.
patent: 5165008 (1992-11-01), Hermansky et al.
patent: 5193142 (1993-03-01), Zhao
patent: 5450523 (1995-09-01), Zhao
K.F. Lee, "Large Vocabulary Speaker-Independent Continuous Speech Recognition: the SPHINX System", Doctoral Dissertation, Carnegie Mellon Univ, CMU-CS-88-148, Apr., 1988 pp. 19-41.
S. Furui, "Unsupervised Speaker Adaptation Method Based on Hierarchical Spectral Clustering," Proc. ICASSP pp. 286-289, Glasgow, Scotland, May, 1989.
Y. Zhao et al, "An HMM Based Speaker-Independent Continuous Speech Recognition System w/Experiments on the TIMIT DATABASE", Proc. ICASSP, pp. 333-336, Toronto, Canada, May, 1991.
Kubala, Francis et al. "Speaker Adaptation From a Speaker-Independent Training Corpus," IEEE ICASSP, pp. 137-140, Apr. 1990.
Huang, X.D. and Lee, K.F., "On Speaker-Independent, Speaker-Dependent, and Speaker-Adaptive Speech Recognition," IEEE ICASSP, pp. 877-868, May 1991.
Rozzi, William A. and Stern, Richard M., "Speaker Adaptation in Continuous Speech Recognition Via Estimation of Correlated Mean Vectors," IEEE ICASSP, pp. 865-868, May 1991.
Schmidbauer, O., Tebelskis, J., "An LVQ Based Reference Model for Speaker Adaptive Speech Recognition," IEEE ICASSP, pp. I-441-I444, Mar. 1992.
Furui, Sadaoki, "Unsupervised Speaker Adaptation Method Based on Hierarchical Spectral Clustering," ICASSP, pp. 286-289, May 1989.
Hunt, Melvyn, "Session S. Speech Communication III: Speech Recognition," J. Acoust. Soc. Am. Suppl. 1, vol. 69, Spring 1981, pp. S41-S42.
Matsumoto, Hiroshi, Wakita, Hisashi, "Vowel Normalization by Frequency Warped Spectral Matching," Elsevier Science Pub. B.V., Speech Communication, vol. 5, 1986, pp. 239-251.
Cox, S.J., Bridle, J.S., "Unsupervised Speaker Adaptation by Probabilistic Spectrum Fitting," ICASSP, pp. 294-297, May 1989.
Cos, S.J., Bridle, J.S. "Simultaneous Speaker Normalization and Utterance Labelling Using Bayesian/Neural Net Techniques," IEEE ICASSP, pp. 161-164, Apr. 1990.
Lee, Chin-Hui et al., "A Study on Speaker Adaptation of Continuous Density HMM Parameters," IEEE ICASSP, pp. 145-148, Apr. 1990.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Self-learning speaker adaptation based on spectral variation sou does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Self-learning speaker adaptation based on spectral variation sou, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Self-learning speaker adaptation based on spectral variation sou will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-314617

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.