Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Patent
1996-06-28
1999-10-05
Dorvil, Richemond
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
704240, G10L 504
Patent
active
059639032
ABSTRACT:
A method and system for dynamically selecting words for training a speech recognition system. The speech recognition system models each phoneme using a hidden Markov model and represents each word as a sequence of phonemes. The training system ranks each phoneme for each frame according to the probability that the corresponding codeword will be spoken as part of the phoneme. The training system collects spoken utterances for which the corresponding word is known. The training system then aligns the codewords of each utterance with the phoneme that it is recognized to be part of. The training system then calculates an average rank for each phoneme using the aligned codewords for the aligned frames. Finally, the training system selects words for training that contain phonemes with a low rank.
REFERENCES:
patent: 4783803 (1988-11-01), Baker et al.
patent: 4866778 (1989-09-01), Baker
patent: 5027406 (1991-06-01), Roberts et al.
patent: 5050215 (1991-09-01), Nishimura
patent: 5182773 (1993-01-01), Bahl et al.
patent: 5278942 (1994-01-01), Bahl et al.
patent: 5293584 (1994-03-01), Brown et al.
patent: 5390278 (1995-02-01), Gupta et al.
patent: 5455889 (1995-10-01), Bahl et al.
patent: 5465318 (1995-11-01), Sejnoha
patent: 5682464 (1997-10-01), Sejnoha
patent: 5682501 (1997-10-01), Sharman
patent: 5715367 (1998-02-01), Gillick et al.
Benet, Bernard, "Dictation Systems for Windows: Dragon, IBM, Kurzweil; Dragon Systems' DragonDictate, IBM's VoiceType Dictation, Kurzweil AI's Kurzweil Voice 1.2; Software Review; Evaluation," The Seybold Report on Desktop Publishing, Seybold Publications, Inc., Jun. 10, 1995, vol. 9, No. 10, p. 12.
"IBM Voice Recognition Explored at Seminar," The Legal Intelligencer, Legal Communications, Ltd., Oct. 5, 1995, p. 2.
Kurzweil Applied Intelligence, Inc., 1997,
Rabiner, Lawrence R., "A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition," IEEE Log No. 8825949, 1989, pp. 267-295.
Hon Hsiao-Wuen
Huang Xuedong D.
Hwang Mei-Yuh
Jiang Li
Ju Yun-Cheng
Dorvil Richemond
Microsoft Corporation
LandOfFree
Method and system for dynamically adjusted training for speech r does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and system for dynamically adjusted training for speech r, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and system for dynamically adjusted training for speech r will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1183265