Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Patent
1996-07-23
1998-08-11
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
704236, G10L 900
Patent
active
057941912
ABSTRACT:
An improved artificial neural network for use in speech recognition is disclosed. It comprises an input layer, a hidden layer, and an output layer, each of these layers consisting of a plurality of nodal points. A set of first weighting coefficients are used between the input layer and the hidden layer which are functions of at least one of the nodal points in the hidden layer and at least one of the nodal points in the input layer; whereas, a set of second weighting coefficients, which are functions of time and at least one of the nodal points in the output, are used to correlate between the hidden layer and output layer. In a preferred embodiment, the first weighting coefficients are calculated using the following formula: ##EQU1## i is the index for nodal point in the input layer and a.sub.j, b.sub.j, and c.sub.j are all training coefficients associated with nodal pointj in the hidden layer; and the second weighting coefficients are calculated using the following formula: ##EQU2## n is the timeframe number, r is the order of an orthogonal polynomial series (.psi., 60 .sub.jkm is the m-th order training coefficient between nodal points j and k, in the hidden and output layers, respectively. The use of the two different sets of weighting coefficients allows a timeframe-based division of the speech signals, resulting in a substantial reduction of parameters required for accurate speech recognition.
REFERENCES:
patent: 5285522 (1994-02-01), Mueller
patent: 5481644 (1996-01-01), Inazumi
Pao-Chung Chang, San-Wei Sun, and Sin-Horng Chen, "Mandarin Tone Recognition by Multi-Layer Perceptron," ICASSP 90, 3-6 Apr. 1990.
W.-Y. Chen and S.-H. Chen, "Speaker-Independent Mandarin Plosive Recognition with Dynamic Features and Multilayer Perceptrons," Electronic Letters 31(4), 16 Feb. 1995.
S.-H. Hwang and S.-H. Chen, "Neural-Network-Based FO Text-to-Speech Synthesizer for Mandarin," IEE Proc.-Vis. Image Signal Process., 141(6), Dec. 1994.
S.. Chang and S.-H. Chen, "Isolated Mandarin Syllable Recognition Using Segmental features," IEE Proc.-Vis. Image Signal Process., 142(1), Feb. 1995.
Lawrence Rabiner and Biing-Hwang Juang, Fundamentals of Speech Recognition, (Prentice-Hall, Inc. Englewood Cliffs, NJ, 1993) pp. 54-89, Dec. 1993.
Hudspeth David R.
Industrial Technology Research Institute
Liauh W. Wayne
Storm Donald L.
LandOfFree
Neural network based speech recognition method utilizing spectru does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Neural network based speech recognition method utilizing spectru, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Neural network based speech recognition method utilizing spectru will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-403554