Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Patent
1997-06-27
1999-12-14
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
704256, G10L 708
Patent
active
060030038
ABSTRACT:
In one embodiment, a speech recognition system is organized with a fuzzy matrix quantizer with a single codebook representing u codewords. The single codebook is designed with entries from u codebooks which are designed with respective words at multiple signal to noise ratio levels. Such entries are, in one embodiment, centroids of clustered training data. The training data is, in one embodiment, derived from line spectral frequency pairs representing respective speech input signals at various signal to noise ratios. The single codebook trained in this manner provides a codebook for a robust front end speech processor, such as the fuzzy matrix quantizer, for training a speech classifier such as a u hidden Markov models and a speech post classifier such as a neural network. In one embodiment, a fuzzy Viterbi algorithm is used with the hidden Markov models to describe the speech input signal probabilistically.
REFERENCES:
patent: 4933973 (1990-06-01), Porter
patent: 5031217 (1991-07-01), Nishimura
patent: 5046099 (1991-09-01), Nishimura
patent: 5185848 (1993-02-01), Aritsuka et al.
patent: 5414796 (1995-05-01), Jacobs et al.
patent: 5583888 (1996-12-01), Ono
C. S. Xydeas and Lin Cong, "Robust Speech Recognition using Fuzzy Matrix Quantization, Neural Networks and Hidden Markov Models," Proc. of EUSIPCO-96, Eighth Eur. Sig. Proc. Conf.: Theor and Appl of Sig Proc, v3, p. 1587-90, Trieste, Italy (Sep. 10-13, 1996).
C. S. Xydeas and Lin Cong, "Robust Speech Recognition using Fuzzy Matrix Quantization and Neural Networks," Proceedings of International Conference on Communication Technology, Beijing, China--ICCT '96, pp. 432-435, IEEE, New York (May 5-7, 1996).
Thomas W. Parson, "Voice and Speech Processing," McGraw-Hill, Inc., New York, 1987, pp. 170-171.
Cong, Lin; "A Study of Robust IWSR Systems"; PhD Thesis submitted to The University of Manchester School of Engineering, Division of Electrical Engineering; Manchester, United Kingdom; pp. 1-209. May 1996.
Waibel, Alexander; "Neural Network Approaches for Speech Recognition"; Chapter 18 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 555-595.
Xydeas, C. S. and Cong, L.; "Combining Neural Network Classification with Fuzzy Vector Quantization and Hidden Markov Models for Robust Isolated Word Speech Recognition"; Signal Processing VIII Theories and Applications, vol. III; Proceedings of the IEEE International Symposium on Information Theory, IEEE Press, 1995, p. 174.
Xydeas, C.S. and Cong, L.; "Robust Speech Recognition in A Car Environment"; Presented at DSP95 International Conference on Digital Signal Processing, Jun. 26-28, 1995, Limassol, Cyprus; vol. 1, pp. 84-89.
Cong, Lin, Prof. C.S. Xydeas, and Anthony Ferwood; "A Study of Robust Isolated Word Speech Recognition Based on Fuzzy Methods"; Presented at EUSIPCO-94, VII European Signal Processing Conference, Sep. 13-16, 1994; Scotland, UK.; 4 pages.
Gibson, Jerry D.; "Coding, Transmission, and Storage"; Chapter 14, Speech Signal Processing, ofThe Electrical Engineering Handbook; Editor-in-Chief Richard C. Dor; .COPYRGT.1993 by CRC Press, Inc.; pp. 2779-314.
Gersho, Allen and Shihua Wang; "Vector Quantization Techniques in Speech Coding"; Chapter 2 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 49-84.
Kroon, Peter and Bishnu S. Atal; "Predictive Coding of Speech Using Analysis-by-Synthesis Techniques"; Chapter 5 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 141-164.
Honda, Masaaki and Yoshinao Shiraki; "Very Low-Bit-Rate Speech Coding"; Chapter 7 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 209-230.
Schroeter, Juergen and M. Mohan Sondhi; "Speech Coding Based on Physiological Models of Speech Production"; Chapter 8 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 231-268.
Asghar Safdar M.
Cong Lin
Advanced Micro Devices , Inc.
Chambers Kent B.
Hudspeth David R.
Storm Donald L.
LandOfFree
Speech recognition system having a quantizer using a single robu does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speech recognition system having a quantizer using a single robu, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech recognition system having a quantizer using a single robu will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-873370