Patent
1992-05-26
1996-12-17
MacDonald, Allen R.
395 252, G10L 506
Patent
active
055862152
ABSTRACT:
The apparatus for the recognition of speech comprises an acoustic preprocessor, a visual preprocessor, and a speech classifier that operates on the acoustic and visual preprocessed data. The acoustic preprocessor comprises a log mel spectrum analyzer that produces an equal mel bandwidth log power spectrum. The visual processor detects the motion of a set of fiducial markers on the speaker's face and extracts a set of normalized distance vectors describing lip and mouth movement. The speech classifier uses a multilevel time-delay neural network operating on the preprocessed acoustic and visual data to form an output probability distribution that indicates the probability of each candidate utterance having been spoken, based on the acoustic and visual data.
REFERENCES:
patent: 4620286 (1986-10-01), Smith et al.
patent: 4757541 (1988-07-01), Beadles
patent: 4937872 (1990-06-01), Hopfield et al.
patent: 4975960 (1990-12-01), Petajan
patent: 5163111 (1992-11-01), Baji et al.
Waibel et al., "Phonerne Recognition: Neural Networks Vs. Hidden Markov Models", IEEE ICASSP 88 Proceedings, v.1, pp. 107-110, 1988.
A. Waibel, "Modular Construction of Time-Delay Neural Networks for Speech Recognition", published in Neural Computation 1, 39-46 (1989).
Petajan, E.D. et al., "An Improved Automatic Lipreading System to Enhance Speech Recognition", ACM SIGCHI-88, 19-25 (1988).
Pentland, A., et al., "Lip Reading: Automatic Visual Recognition of Spoken Words", Processing Image Understanding and Machine Vision, Optical Society of America, Jun. 12-14 (1984).
Yuhas, B.P., et al., "Integration of Acoustic and Visual Speech Signals Using Neural Networks", Nov. 1989, IEEE Communications Magazine (1989).
Levine Earl I.
Stork David G.
Wolff Gregory J.
MacDonald Allen R.
Onka Thomas J.
Ricoh & Company, Ltd.
Ricoh Corporation
LandOfFree
Neural network acoustic and visual speech recognition system does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Neural network acoustic and visual speech recognition system, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Neural network acoustic and visual speech recognition system will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1998823