Patent
1994-06-20
1997-03-25
MacDonald, Allen R.
395 265, 395 249, 395 251, 395 242, G10L 708, G10L 506
Patent
active
056152990
ABSTRACT:
A speech recognition technique utilizes a set of N different principal discriminant matrices. Each principal discriminant matrix is associated with a distinct class. The class is an indication of the proximity of a speech segment to neighboring phones. A technique for speech encoding includes arranging speech signal into a series of frames. A feature vector is derived which represents the speech signal for a speech segment or series of speech segments for each frame. A set of N different projected vectors are generated for each frame, by multiplying the principal discriminant matrices by the vector. This speech encoding technique is capable of being used in speech recognition systems by utilizing models, in which each model transition is tagged with one of the N classes. The projected vector is utilized with the corresponding tag to compute the probability that at least one particular speech port is present in said frame.
REFERENCES:
patent: 4741036 (1988-04-01), Bahl et al.
patent: 5072452 (1991-12-01), Brown et al.
"Vector Quantization Procedure For Speech Recognition Systems Using Discrete Parameter Phoneme-Based Markov Word Models" IBM Technical Disclosure Bulletin, vol. 32, No. 7, Dec. 1989 pp. 320-321.
"Phoneme Recognition Using Time-Delay Neural Networks", by Waibel, A. et al., IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 37, No. 3 1989 pp. 328-339.
"Admissible Strategies for Reducing Search Effort in Real Time Speech Recognition Systems" by L. R. Bahl, et al Elsevier Science Publishers B.V., 1990 pp.1371-1374.
"Application of an Auditory Model to Speech Recognition" by Jordan Cohen, J. Acoust, Soc.Am 85 (6), Jun. 1989 pp. 2623-2629.
"An IBM Based Large-Vocabulary Isolated-Utterance Speech Recognizer" by A. Averbuch et al. 1986 IEEE pp. 53-56.
"Syllogistic Reasoning in Fuzzy Logic and its Application to Usuality And Reasoning with Dispositions", by Lofti A. Zadeh, 1985 IEEE, pp. 754-762.
"A Method For The Construction of Acoustic Markov Models for Words" by L. R. Bahl et al., 1993 IEEE pp. 443-452 (vol. 1, No. 4).
"Speech Recognition Using Noise-Adaptive Prototypes" by Arthur Nadas, D. Nahamoo, IEEE Transactions on Acoustics, vol. 37, No. 10, Oct. 1989, pp. 1495-1503.
"Differential Competitive Learning For Centroid Estimation and Phoneme Recognition" by S. Kong and B. Kosko, IEEE Transactions on Neural Networks, vol. 2, No. 1, Jan. 1991, pp. 118-124.
"A Maximum Likelihood Approach to Continuous Speech Recognition", by L. R. Bahl, et al., IEEE Transactions on Pattern Analysis and Machine Intelligence vol. PAM1-5, No. 2, Mar. 1983 pp. 179-190.
"Multonic Markov Word Models For Large Vocabulary Continuous Speech Recognition" by L. R. Bahl, et al., IEEE Transactions of Speech and Audio Processing, vol. 1, No. 3, Jul. 1993, pp. 334-343.
"Speaker Adaptation via VQ Prototype Modification" by D. Rtischev, et al., IEEE Transactions on Speech and Audio Processing, vol. 2, No. 1, Part 1, Jan. 1994 pp. 94-97.
"A Fast Approximate Acoustic Match For Large Vocabulary Speech Recognition" by L:. Bahl, et al., IEEE Transactions on Speech and Audio Processing vol. 1, No. 1, Jan. 1993, pp. 59-67.
Bahl Lahit R.
De Souza Peter V.
Gopalakrishnan Ponani
Picheny Michael A.
International Business Machines - Corporation
MacDonald Allen R.
Smits Talivaldis Ivais
LandOfFree
Speech recognition using dynamic features does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speech recognition using dynamic features, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech recognition using dynamic features will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2211042