Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Patent
1997-10-27
2000-05-23
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
704222, 704256, 704251, G10L 900
Patent
active
060675152
ABSTRACT:
A speech recognition system utilizes both split matrix and split vector quantizers as front ends to a second stage speech classifier such as hidden Markov models (HMMs) to, for example, efficiently utilize processing resources and improve speech recognition performance. Fuzzy split matrix quantization (FSMQ) exploits the "evolution" of the speech short-term spectral envelopes as well as frequency domain information, and fuzzy split vector quantization (FSVQ) primarily operates on frequency domain information. Time domain information may be substantially limited which may introduce error into the matrix quantization, and the FSVQ may provide error compensation. Additionally, acoustic noise influence may affect particular frequency domain subbands. This system also, for example, exploits the localized noise by efficiently allocating enhanced processing technology to target noise-affected input signal parameters and minimize noise influence. The enhanced processing technology includes a weighted LSP and signal energy related distance measure in training Linde-Buzo-Gray (LBG) algorithm and during recognition. Multiple codebooks may also be combined to form single respective codebooks for split matrix and split vector quantization to lower processing resources demand.
REFERENCES:
patent: 4383135 (1983-05-01), Scott et al.
patent: 4519094 (1985-05-01), Brown et al.
patent: 4933973 (1990-06-01), Porter
patent: 4975955 (1990-12-01), Taguchi
patent: 5031217 (1991-07-01), Nishimura
patent: 5046099 (1991-09-01), Nishimura
patent: 5185848 (1993-02-01), Aritsuka et al.
patent: 5228087 (1993-07-01), Bickerton
patent: 5255339 (1993-10-01), Fette et al.
patent: 5285522 (1994-02-01), Mueller
patent: 5313555 (1994-05-01), Kamiya
patent: 5414796 (1995-05-01), Jacobs et al.
patent: 5583888 (1996-12-01), Ono
patent: 5596679 (1997-01-01), Wang
patent: 5625747 (1997-04-01), Goldberg et al.
patent: 5696878 (1997-12-01), Ono et al.
patent: 5734793 (1998-03-01), Wang
Lin Cong "A Study of Robust IWSR Systems", May 1996.
Xydeas, C.S. Prof. and Cong, Lin "Robust Speech Recognition Using Fuzzy Matrix Quantisation, Neural Networks and Hidden Markov Models" 1996 pp. 1587-1590.
Cong, Ling, Xydeas, Costas S. Prof. and Ferwood, Anthony F. Combining Fuzzy Vector Quantization and Neural Network Classification for Robust Isolated Word Speech Recognition: Singapore ICCS 1994, pp. 884-887.
Parsons, Thomas W.; "Voice and Speech Processing"; McGraw-Hill, Inc., New York, 1987, pp. 170-171.
Xydeas, C.S. and Lin Cong; "Robust Speech Recognition Using Fuzzy Matrix Quantization and Neural Networks"; Proceedings of International Conference on Communication Technology; Beijing, China--ICCT '96; pp. 432-435; IEEE; New York (May 5-7, 1996).
Cong, Lin; "A Study of Robust IWSR Systems"; PhD Thesis submitted to The University of Manchester School of Engineering, Division of Electrical Engineering; Manchester, United Kingdom; pp. 1-209. May 1996.
Waibel, Alexander; "Neural Network Approaches for Speech Recognition"; Chapter 18 of Advances in Speech Signal Processing; edited by Sadaoki Surui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 555-595.
Xydeas, C. S. and Cong, L.; "Combining Neural Network Classification with Fuzzy Vector Quantization and Hidden Markov Models for Robust Isolated Word Speech Recognition"; Signal Processing VIII Theories and Applications, vol. III; Proceedings of the IEEE International Symposium on Information Theory, IEEE Press, 1995, p. 174.
Xydeas, C. S. Cong, L.; "Robust Speech Recognition in A Car Environment"; Presented at DSP95 International Conference on Digital Signal Processing, Jun. 26-28, 1995, Limassol, Cyprus; vol. 1, pp. 84-89.
Cong, Lin, Prof. C.S. Xydeas, and Anthony Ferwood; "A Study of Robust Isolated Word Speech Recognition Based on Fuzzy Methods"; Presented at EUSIPCO-94, VII European Signal Processing Conference, Sep. 13-16, 1994; Scotland, UK.; 4 pages.
Gibson, Jerry D.; "Coding, Transmission, and Storage"; Chapter 14, Speech Signal Processing, of The Electrical Engineering Handbook; Editor-in-Chief Richard C. Dorf; .COPYRGT.1993 by CRC Press, Inc.; pp. 279-314.
Gersho, Allen and Shihua Wang; "Vector Quantization Techniques in Speech Coding"; Chapter 2 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 49-84.
Kroon, Peter and Bishnu S. Atal; "Predictive Coding of Speech Using Analysis-by-Synthesis Techniques"; Chapter 5 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992, pp. 141-164.
Honda, Masaaki and Yoshiano Shiraki; "Very Low-Bit-Rate Speech Coding"; Chapter 7 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 209-230.
Schroeter, Juergen and M. Mohan Sondhi; "Speech Coding Based on Physiological Models of Speech Production"; Chapter 8 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992, pp. 231-268.
Lawrence Rabiner and Biing-Hwang Juang, "Fundamentals of Speech Recognition," Prentice Hall PTR (Englewood Cliffs, New Jersey, 1993), pp. 190-195.
Asghar Safdar M.
Cong Lin
Abebe Daniel
Advanced Micro Devices , Inc.
Chambers Kent B.
Hudspeth David R.
LandOfFree
Split matrix quantization with split vector quantization error c does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Split matrix quantization with split vector quantization error c, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Split matrix quantization with split vector quantization error c will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1843733