Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Patent
1998-12-10
1999-11-09
Wieland, Susan
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
704256, 704261, G10L 300
Patent
active
059831782
ABSTRACT:
A speaker clustering apparatus generates HMMs for clusters based on feature quantities of a vocal-tract configuration of speech waveform data, and a speech recognition apparatus provided with the speaker clustering apparatus. In response to the speech waveform data of N speakers, an estimator estimates feature quantities of vocal-tract configurations, with reference to correspondence between vocal-tract configuration parameters and Formant frequencies predetermined based on a predetermined vocal tract model of a standard speaker. Further, a clustering processor calculates speaker-to-speaker distances between the N speakers based on the feature quantities of the vocal-tract configurations of the N speakers as estimated, and clusters the vocal-tract configurations of the N speakers using a clustering algorithm based on calculated speaker-to-speaker distances, thereby generating K clusters. Then the clustering processor trains an initial HMM based on the speech waveform data of speakers respectively belonging to the generated K clusters, thereby generating K hidden Markov models corresponding to the K clusters.
REFERENCES:
G. Fant, "Non-Uniform Vowel Nowael Normalization"; Speech Transmission Laboratory Quarterly Progress and Status Report, vol. 2-3, pp. 1-19, 1975.
Tetsuo Kosaka et al.; "Speaker-Independent Speech Recognition Based On Tree-Structure Speaker Clustering", Computer Speech and Language, No. 10, pp. 55-74, 1996.
Deng Li
Naito Masaki
Sagisaka Yoshinori
ATR Interpreting Telecommunications Research Laboratories
Wieland Susan
LandOfFree
Speaker clustering apparatus based on feature quantities of voca does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speaker clustering apparatus based on feature quantities of voca, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speaker clustering apparatus based on feature quantities of voca will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1469394