Distance measure in a speech recognition system for speech recog

Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

G10L 708

Patent

active

060321164

ABSTRACT:
One embodiment of a speech recognition system is organized with speech input signal preprocessing and feature extraction followed by a fuzzy matrix quantizer (FMQ). Frames of the speech input signal are represented by a vector .function. of line spectral pair frequencies and are fuzzy matrix quantized to respective a vector .function. entries in a codebook of the FMQ. A distance measure between .function. and .function., d(.function.,.function.), is defined as ##EQU1## where the constants .alpha..sub.1, a.sub.2, .beta..sub.1 and .beta..sub.2 are set to substantially minimize quantization error, and e.sub.i is the error power spectrum of the speech input signal and a predicted speech input signal at the ith line spectral pair frequency of the speech input signal. The speech recognition system may also include hidden Markov models and neural networks, such as a multilevel perceptron neural network, speech classifiers.

REFERENCES:
patent: 4383135 (1983-05-01), Scott et al.
patent: 4519094 (1985-05-01), Brown et al.
patent: 4933973 (1990-06-01), Porter
patent: 4975955 (1990-12-01), Taguchi
patent: 5031217 (1991-07-01), Nishimura
patent: 5046099 (1991-09-01), Nishimura
patent: 5185848 (1993-02-01), Aritsuka et al.
patent: 5228087 (1993-07-01), Bickerton
patent: 5255339 (1993-10-01), Fette et al.
patent: 5285522 (1994-02-01), Mueller
patent: 5313555 (1994-05-01), Kamiya
patent: 5414796 (1995-05-01), Jacobs et al.
patent: 5583888 (1996-12-01), Ono
patent: 5596679 (1997-01-01), Wang
patent: 5625747 (1997-04-01), Goldberg et al.
patent: 5696878 (1997-12-01), Ono et al.
patent: 5734793 (1998-03-01), Wang
Xydeas, C.S. Prof. and Cong, Lin "Robust Speech Recognition Using Fuzzy Matrix Quantisation, Neural Networks and Hidden Markov Models" Sep. 1996, pp. 1587-1590.
Cong, Ling, Xydeas, Costas S. Prof. and Ferwood, Anthony F. Combining Fuzzy Vector Quantization and Neural Network Classification for Robust Isolated Word Speech Recognition: Singapore ICCS 1994, pp. 884-887.
Parsons, Thomas W., "Voice and Speech Processing"; McGraw-Hill, Inc., New York, 1987; pp. 170-171.
Xydeas, C.S. and Lin Cong; "Robust Speech Recognition Using Fuzzy Matrix Quantization and Neural Networks"; Proceedings of International Conference on Communication Technology; Beijing, China--ICCT '96; pp. 432-435; IEEE; New York (May 5-7, 1996).
Xydeas, C.S. and Cong, Lin; "Speech Robust Recognition Using Fuzzy Matrix Quantization, Neural Networks and Hidden Markov Models"; Proc. of EUSIPCO-96; Eighth Eur. Sig. Proc. Conf.: Theory and Appl. of Sig Proc, v3; pp. 1587-1590; Treiste, Italy (Sep. 10-13, 1996).
Cong, Lin; "A Study of Robust IWSR Systems"; PhD Thesis submitted to The University of Manchester School of Engineering, Division of Electrical Engineering; Manchester, United Kingdom; pp. 1-209. May 1996.
Waibel, Alexander; "Neural Network Approaches for Speech Recognition"; Chapter 18 ofAdvances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 555-595.
Xydeas, C. S. and Cong, L.; "Combining Neural Network Classification with Fuzzy Vector Quantization and Hidden Markov Models for Robust Isolated Word Speech Recognition"; Signal Processing VIII Theories and Applications, vol. III; Proceedings for the IEEE International Symposium on Information Theory, IEEE Press, 1995, p. 174.
Xydeas, C. S. and Cong, L. "Robust Speech Recognition in A Car Environment"; Presented at DSP95 International Conference on Digital Signal Processing, Jun. 26-28, 1995, Limassol, Cyprus; vol. 1, pp. 84-89.
Cong, Lin, Prof. C.S. Xydeas, and Anthony Ferwood; "A Study of Robust Isolated Word Speech Recognition Based on Fuzzy Methods"; Presented at EUSIPCO-94, VII European Signal Processing Conference, Sep. 13-16, 1994; Scotland, UK.; 4 pages.
Gibson, Jerry D.; "Coding, Transmission, and Storage"; Chapter 14, Speech Signal Processing, ofThe Electrical Engineering Handbook; Editor-in-Chief Richard C. Dorf; .COPYRGT.1993 by CRC Press, Inc.; pp. 279-314.
Gersho, Allen and Shihua Wang; "Vector Quantization Techniques in Speech Coding"; Chapter 2 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, New York, Inc.; New York; 1992; pp. 49-84.
Kroon, Peter and Bishnu S. Atal; "Predictive Coding of Speech Using Analysis-by-Synthesis Techniques"; Chapter 5 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 141-164.
Honda, Masaaki and Yoshinao Shiraki; "Very Low-Bit-Rate Speech Coding"; Chapter 7 ofAdvances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 209-230.
Schroeter, Juergen and M. Mohan Sondhi; "Speech Coding Based on Physiological Models of Speech Production"; Chapter 8 of Advances in Speech Signal Processing; edited by Sadaoki Furui and M. Mohan Sondhi; Marcel Dekker, Inc.; New York, New York; 1992; pp. 231-268.
Lawrence Rabiner and Biing-Hwang Juang, "Fundamentals of Speech Recognition," Prentice Hall PTR (Englewood Cliffs, New Jersey, 1993), pp. 190-195.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Distance measure in a speech recognition system for speech recog does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Distance measure in a speech recognition system for speech recog, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Distance measure in a speech recognition system for speech recog will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-691966

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.