Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis
Patent
1997-10-28
2000-11-21
Zele, Krista
Data processing: speech signal processing, linguistics, language
Speech signal processing
Synthesis
704231, 704236, 704239, 704240, 704247, 704254, 704255, G10L 1300
Patent
active
061515752
ABSTRACT:
A source-adapted model for use in speech recognition is generated by defining a linear relationship between a first element of an initial model and a first element of the source-adapted model. Thereafter, speech data that corresponds to the first element of the initial model is assembled from a set of speech data for a particular source associated with the source-adapted model. A linear transform that maps between the assembled speech data and the first element of the initial model is then determined. Finally, a first element of the source-adapted model is produced from the first element of the initial model using the linear transform.
REFERENCES:
patent: 4759068 (1988-07-01), Bahl et al.
patent: 4805218 (1989-02-01), Bamberg et al.
patent: 4805219 (1989-02-01), Baker et al.
patent: 4817156 (1989-03-01), Bahl et al.
patent: 4817158 (1989-03-01), Picheny
patent: 4817161 (1989-03-01), Kaneko
patent: 4819271 (1989-04-01), Bahl et al.
patent: 4827521 (1989-05-01), Bahl et al.
patent: 4829576 (1989-05-01), Porter
patent: 4829577 (1989-05-01), Kuroda et al.
patent: 4831550 (1989-05-01), Katz
patent: 4833712 (1989-05-01), Bahl et al.
patent: 4837831 (1989-06-01), Gillick et al.
patent: 4876720 (1989-10-01), Kaneko et al.
patent: 4882759 (1989-11-01), Bahl et al.
patent: 4903305 (1990-02-01), Gillick et al.
patent: 4914703 (1990-04-01), Gillick
patent: 4926488 (1990-05-01), Nadas et al.
patent: 4931950 (1990-06-01), Isle et al.
patent: 4972485 (1990-11-01), Dautrich et al.
patent: 4980918 (1990-12-01), Bahl et al.
patent: 5027406 (1991-06-01), Roberts et al.
patent: 5031217 (1991-07-01), Nishimura
patent: 5033087 (1991-07-01), Bahl et al.
patent: 5036538 (1991-07-01), Oken et al.
patent: 5046099 (1991-09-01), Nishimura
patent: 5050215 (1991-09-01), Nishimura
patent: 5054074 (1991-10-01), Bakis
patent: 5054085 (1991-10-01), Meisel et al.
patent: 5072452 (1991-12-01), Brown et al.
patent: 5127055 (1992-06-01), Larkey
patent: 5129001 (1992-07-01), Bahl et al.
patent: 5170432 (1992-12-01), Hackbarth et al.
patent: 5182773 (1993-01-01), Bahl et al.
patent: 5202952 (1993-04-01), Gillick et al.
patent: 5276766 (1994-01-01), Bahl et al.
patent: 5278942 (1994-01-01), Bahl et al.
patent: 5280562 (1994-01-01), Bahl et al.
patent: 5280563 (1994-01-01), Ganong
patent: 5293451 (1994-03-01), Brown et al.
patent: 5428707 (1995-06-01), Gould et al.
patent: 5440663 (1995-08-01), Moese et al.
patent: 5467425 (1995-11-01), Lau et al.
patent: 5497447 (1996-03-01), Bahl et al.
patent: 5623578 (1997-04-01), Mikkilineni
patent: 5710864 (1998-01-01), Juang et al.
patent: 5715367 (1998-02-01), Gillick et al.
patent: 5793891 (1998-08-01), Takahashi et al.
patent: 5864810 (1999-01-01), Digalakis et al.
Asadi, Ayman, "Automatic Modeling for Adding New Words to a Large Vocabulary", ICASSP 91, vol. 1 (1991), pp. 305-308.
Bahl, Lalit, "A Maximum Likelihood Approach to Continuous Speech Recognition," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. PAMI-5, No. 2 (Mar. 1983), pp. 179-190.
Bahl, L.R., "Adaptation of Large Vocabulary Recognition System," ICASSP-92, vol. 1 (Mar. 1992), pp. 1477-1480.
Bahl, L.R., "Automatic Selection of Speech Prototypes," IBM Technical Disclosure Bulletin, vol. 24, No. 4 (Sep. 1981), pp. 2042-2043.
Bahl, L.R., "Automatic High-Resolution Labeling of Speech Waveforms," IBM Technical Disclosure Bulletin, vol. 23, No. 7B (Dec. 1980), pp. 3466-3467.
Bahl, L.R. et al., "Constructing Groups of Acoustically Confusable Words," IEEE (1990), pp. 85-88.
Bamberg, Paul G. et al., "Adaptation Performance in a Large-Vocabulary Recognizer," Dragon Systems, Inc., Newton, MA, pp. 1-7.
Fissore, Luciano et al., "Lexical Access to Large Vocabularies For Speech Recognition," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 37, No. 8 (Aug. 1989), pp. 1197-1213.
Gillick, Larry et al., "Rapid Match Training For Large Vocabularies," Dragon Systems, Inc., Newton, MA.
Haeb-Unbach, R., "Automatic Transcription of Unknown Words in a Speech Recognition System," The 1995 International Conference on Acoustics, Speech, and Signal Processing, vol. 1 (May 1995), pp. 840-843.
Imai, Toru. "A New Method for Automatic Generation of Speaker-Dependent Phonological Rules," The 1995 International Conference on Acoustics, Speech, and Signal Processing, vol. 1 (May 1995), pp. 864-867.
Mandel, Mark A. et al., "A Commercial Large-Vocabulary Discrete Speech Recognition System: DragonDictate," Language and Speech, vol. 35 (1, 2) (1992), pp. 237-246.
Gillick Laurence S.
Nagesha Venkatesh
Newman Michael Jack
Dragon Systems, Inc.
Opsasnick Michael N.
Zele Krista
LandOfFree
Rapid adaptation of speech models does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Rapid adaptation of speech models, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Rapid adaptation of speech models will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1266504