Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Patent
1997-03-21
1998-04-28
MacDonald, Allen R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
704200, 704243, 704247, 704256, 704231, G10L 500
Patent
active
057458731
ABSTRACT:
A method for recognizing speech elements (e.g., phones) in utterances includes the following steps. Based on acoustic frequency, at least two different acoustic representatives are isolated for each of the utterances. From each acoustic representative, tentative decision information on the speech element in the corresponding utterance is derived. A final decision on the speech element in the utterance is then generated, based on the tentative decision information derived from more than one of the acoustic representatives.
REFERENCES:
patent: 4227177 (1980-10-01), Moshier
patent: 4348553 (1982-09-01), Baker et al.
patent: 4432096 (1984-02-01), Bunge
patent: 4509186 (1985-04-01), Omura et al.
patent: 4587670 (1986-05-01), Levinson et al.
patent: 4805218 (1989-02-01), Bamberg et al.
patent: 4805219 (1989-02-01), Baker et al.
patent: 4817159 (1989-03-01), Hoshimi et al.
patent: 4905285 (1990-02-01), Allen et al.
patent: 4914703 (1990-04-01), Gillick
patent: 4937871 (1990-06-01), Hattori
patent: 4956865 (1990-09-01), Lennig et al.
patent: 4963030 (1990-10-01), Makur
patent: 5050215 (1991-09-01), Nishimura
patent: 5210820 (1993-05-01), Kenyon
patent: 5268990 (1993-12-01), Cohen et al.
patent: 5299284 (1994-03-01), Roy
Paul, The Lincoln Laboratory Journal, vol. 3, 1990, pp. 41-61, "Speech Recognition Using Markov Models".
Milner et al., ASHA Reports, pp. 30-44, 1984, No. 14, "Perception of Filtered Speech by Hearing-Impaired Listeners".
Davis, et al., IEEE Transactions on Acoustics, Speech, and Signal Processing, AASP-28:pp.357-366, Aug. 1980, "Comparison of Parametric Representations For Monosyllabic Word Recognition in Continuously Spoken Sentences".
Lee et al., IEE Transactions on Acoustics, Speech, and Signal Processing, 37:pp. 1641-1648, 1989, "Speaker-Independent Phone Recognition Using Hidden Markov Models".
Gupta et al., IEEE, pp. 697-700, 1987, "Integration of Acoustic Information in a Large Vocabulary Word Recognizer".
Vilaclara, Signal Processing III: Theories and Applications, 407-410, 1986, "Recognition of Labial-Doubles for a Substitution Hearing Aid".
Schwartz et al., IEEE 1205-1208, 1985, "Context-Dependent Modeling for Acoustic-Phonetic Recognition of Continuous Speech".
Dautrich et al, IEEE Transactions on Acoustics, Speech, and Signal Processing, AASP-31:793-806, 1983, "On The Effects of Varying Filter Bank Parameters on Isolated Word Recognition".
Lee et al., Speech Communication 9:497-508, 1990, "Speech Recognition Using Hidden Markov Models: a CMU Perspective".
Rabiner, Proceedings of the IEEE 77:257-285, Feb. 1989, "A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition".
Braida, The Experimental Psychology Society 43A (3) 647-677, 1991 "Crossmodel Intergration in the Identification of Consonant Segments".
G.M. White and R.B. Neely, "Speech recognition experiments with linear prediction, bandpass filtering, and dynamic programming", IEEE Trans. Acoust. Speech Signal Processing, vol. ASSP-24, pp. 183-188, 1976.
H. Secker-Walker and C. Searle, "Time-domain analysis of auditory-nerve-fiber firing rates", J. Acoust. Soc. Amer., 88, pp. 1427-1436, 1990.
Y. Linde, A. Buzo, and R.M. Gray, "An algorithm for vector quantizer design", IEEE Transactions on Communications, COM-28(1), pp. 84-95, 1980.
L.F. Lamel, R.H. Kassel and S. Seneff, "Speech database development: Design and analysis of the acoustic-phonetic corpus", Proc. Darpa Speech Recogn. Workshop, L.S. Bauman, Ed., pp. 100-109, 1986.
L.E. Baum, "An inequality and associated maximization technique in statistical estimation of probabilistic functions of Markov processes", Inequalities, 3, pp. 1-8, 1972.
J.K. Baker, "The Dragon System--An overview", IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-23, pp. 24-29, 1975.
J.R. Bellegarda and D. Nahamoo, "Tied Mixture Continuous Parameter Modeling for Speech Recognition", IEEE Trans. Acoust., Speech, Signal Processing, 38(12), Dec. 1990, pp. 2033-2045.
R.M. Gray, "Vector Quantization", Readings in Speech Recognition, Ed. by A. Waibel & K.F. Lee, Morgan Kaufman Publishers, Inc., 1990, pp. 75-100.
J.K. Baker, "Stochastic Modeling for Automatic Speech Understanding", Readings in Speech Recognition, Ed. by A. Waibel & K.F. Lee, Morgan Kaufman Publishers, Inc., 1990, pp. 297-319.
X.D. Huang et al., "Semi-Continuous Hidden Markov Models for Speech Signals", Readings in Speech Recognition, Ed. by A. Waibel & K.F. Lee, Morgan Kaufman Publishers, Inc., 1990, pp. 340-365.
K.F. Lee et al., "An Overview of the Spinx Speech Recognition System", Readings in Speech Recognition, Ed. by A. Waibel & K.F. Lee, Morgan Publishers, Inc., 1990, pp. 600-610.
Bellegarda et al., "Automatic Recognition of Integrated Speech and Handwriting Information", IBM Research, T.J. Watson Research Center, Yorktown Heights, NY, Oct. 21, 1991, pp. 1-22.
Fletcher, "Methods of Measuring The Recognition Aspect of Speech", Speech and Hearing in Communication, Chapter 15, 1953, pp. 278-317.
Allen, "How Do Humans Process and Recognize Speech?", IEEE Transactions on Speech and Audio Processing, vol. 2, No. 4, Oct. 1994, 567-577.
Research Triangle Institute, Autocuer Product Description Manuscript.
Miller et al., "An Analysis of Perceptual Confusions Among Some English Consonants", The Journal of the Acoustical Society of America, 27(2):338-352 (1955).
O'Shaughnessy, "Redundancy in Speech" Speech Communication Human and Machine, Addison-Wesley Publishing Company, Copyright 1987.
Stevens et al., "Toward a Phonetic and Phonological Theory of Redundant Features", Invariance and Variability in Speech Processes, Erlbaum Assoc. (1987).
Gupta et al., "Integration of Acoustic Information in a Large Vocabulary Word Recognizer", 1987, IEEE.
Bourlard et al., "Towards Increasing Speech Recognition Error Rates", May 1996, Speech Communication, vol. 18, No. 3.
Braida Louis D.
Duchnowski Paul
Collins Alphonso A.
MacDonald Allen R.
Massachusetts Institute of Technology
LandOfFree
Speech recognition using final decision based on tentative decis does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speech recognition using final decision based on tentative decis, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech recognition using final decision based on tentative decis will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1544033