Patent
1993-08-19
1995-10-10
MacDonald, Allen R.
395 241, 395 245, 395 26, 395 261, G10L 500, G06F 1518
Patent
active
054577705
ABSTRACT:
A system and method for recognizing an utterance of a speech in which each reference pattern stored in a dictionary is constituted by a series of phonemes of a word to be recognized, each phoneme having a predetermined length of continued time and having a series of frames and a lattice point (i, j) of an i-th number phoneme at an j-th number frame having a discriminating score derived from Neural Networks for the corresponding phoneme. When the series of phonemes recognized by a phoneme recognition block is compared with each reference pattern, one i of the input series of phonemes recognized by the phoneme recognition block being calculated as a matching score as gk(i, j); ##EQU1## wherein ak(i, j) denotes an output score value of the Neural Networks of the j-th number phoneme at the j-th number frame of the reference pattern and p denoted a penalty constant to avoid an extreme shrinkage of the phonemes, a total matching score is calculated as gk (I, J), I denoting the number of frames of the input series of phonemes and J denoting the number of phonemes of the reference pattern k, and one of the reference patterns which gives a maximum matching score is output as the word recognition.
REFERENCES:
patent: 4637045 (1987-01-01), Noso et al.
patent: 4799261 (1989-01-01), Lin et al.
patent: 4937872 (1990-06-01), Hopfield et al.
patent: 4975961 (1990-12-01), Sakoe
patent: 5027406 (1991-06-01), Roberts et al.
patent: 5060278 (1991-12-01), Fukumizu
patent: 5175793 (1992-12-01), Sakamoto et al.
Phoneme-based Wnd Recognition by NN . . . Akihiro Hirai et al., IEEE 17-21 Jun. 1990.
A Hybrid NN, Dynamic programming word spotlov Zeppenfeld et al., IEEE 23-26 Mar. 1992.
A Speech Recognizer Optimally Combining Loorarms Vector Quantization, Dynamic Programming and Multi-Layer Percoh Drain Covrt et al., IEEE 23-26 Mar. 1992.
Word Recognition based in the Combination of a Sequential NN and the GPDM Discriminative training Chen et al. IEEE 1 Oct. 1991.
Takami et al., "Phoneme Recognitiion by Pairwise Discriminant TDNNs", ATR Interpreting Telephony Research Laboratories, vol. 16, pp. 677-680.
Ryohei Nakatsu, A Japanese Paper of Information Processing, vol. 24, No. 8, (Yokoshuki Electrical Communication Laboratory), pp. 984-992, 1983.
Kenichi Mori, Japanese book entitled "Pattern Recognition", Published by Shadan Hozin Electronic, Information, and Communication Society on Apr. 26, 1993, pp. 116-123.
Dayhoff, Neural Network Architectures, Chapter 5, 1990.
Lynn et al., "Introductory Digital Signal Processing with Computer Applications, FFT Processing", pp. 251-283, 1992.
Lynn et al., Introductory Digital Signal Processing with Computer Application, "The Fourier Transform Method", pp. 148-157, 1992.
Dorvil Richemond
Kabushiki Kaisha Meidensha
MacDonald Allen R.
LandOfFree
Speaker independent speech recognition system and method using n does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speaker independent speech recognition system and method using n, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speaker independent speech recognition system and method using n will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2316067