Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Patent
1996-05-06
1998-09-08
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
704258, 704240, 704254, 704255, G10L 500
Patent
active
058060308
ABSTRACT:
The clustering technique produces a low complexity and yet high accuracy speech representation for use with speech recognizers. The task database comprising the test speech to be modeled is segmented into subword units such as phonemes and labeled to indicate each phoneme in its left and right context (triphones). Hidden Markov Models are constructed for each context-independent phoneme and trained. Then the center states are tied for all phonemes of the same class. Triphones are trained and all poorly-trained models are eliminated by merging their training data with the nearest well-trained model using a weighted divergence computation to ascertain distance. Before merging, the threshold for each class is adjusted until the number of good models for each phoneme class is within predetermined upper and lower limits. Finally, if desired, the number of mixture components used to represent each model may be increased and the models retrained. This latter step increases the accuracy.
REFERENCES:
patent: 4783804 (1988-11-01), Juang et al.
patent: 4803729 (1989-02-01), Baker
patent: 4829577 (1989-05-01), Kuroda et al.
patent: 5075896 (1991-12-01), Wilcox et al.
patent: 5289562 (1994-02-01), Mizuta et al.
patent: 5450523 (1995-09-01), Zhao
patent: 5502790 (1996-03-01), Yi
patent: 5598507 (1997-01-01), Kimber et al.
Chawan Vijay B.
Hudspeth David R.
LandOfFree
Low complexity, high accuracy clustering method for speech recog does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Low complexity, high accuracy clustering method for speech recog, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Low complexity, high accuracy clustering method for speech recog will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1296018