Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Patent
1998-01-22
1999-09-14
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
704242, 704240, 704252, G10L 506
Patent
active
059537019
ABSTRACT:
A method of gender dependent speech recognition includes the steps of identifying phone state models common to both genders, identifying gender specific phone state models, identifying a gender of a speaker and recognizing acoustic data from the speaker. A method of constructing a gender-dependent speech recognition model includes the steps of providing training data of a known gender, aligning the training data, tagging the training data with a gender to create gender-tagged data, determining a gender question at a node to determine gender dependence of the gender-tagged data, determining a phonetic context question at the node to determine phonetic context dependence of the gender-tagged data, determining a highest value of an evaluation function between the gender dependence and the phonetic context dependence to determine which dependence is a dominant dependence, splitting the data of the dominant dependence into child nodes according to likelihood criteria, comparing the highest value with a threshold value to determine if additional splitting is necessary, repeating theses steps for each child node until the highest value is below the threshold value and counting the nodes having gender dependence to determine an overall gender dependence level. A gender-dependent speech recognition system includes an input device for inputting speech to a preprocessor. The preprocessor converts the speech into acoustic data, and a processor for identifies gender-dependent phone state models and phone state modes common to both genders. The phone state models are stored in a memory device wherein the processor recognizes the speech in accordance with the phone state models.
REFERENCES:
patent: 5675705 (1997-10-01), Singhal
patent: 5787394 (1998-07-01), Bahl et al.
patent: 5825978 (1998-10-01), Digalakis et al.
L. R. Bahl et al., "Decision Trees for Phonological Rules in Continuous Speech", S3.9, .COPYRGT.1991 IEEE. pp. 185-188.
L. R. Bahl et al, "Robust Methods For Using Context-Dependent Features And Models In a Continuous Speech Recognizer.", .COPYRGT.1994 IEEE, pp. I-533-I-5336.
Chalapathy V. Neti et al., "Word-Based Confidence Measures As A Guide For Stack Search In Speech Recognition", .COPYRGT.1997 IEEE, pp. 883-886.
Neti Chalapathy Venkata
Roukos Salim Estephan
Hudspeth David R.
International Business Machines - Corporation
Smits Talivaldis Ivars
Tassinari, Jr. Robert P.
LandOfFree
Speech recognition models combining gender-dependent and gender- does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speech recognition models combining gender-dependent and gender-, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech recognition models combining gender-dependent and gender- will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1520498