Neural network acoustic and visual speech recognition system tra

Image analysis – Histogram processing – For setting a threshold

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

395 239, 382156, G01L 506, G01L 900

Patent

active

056218583

ABSTRACT:
The apparatus for the recognition of speech includes an acoustic preprocessor, a visual preprocessor, and a speech classifier that operates on the acoustic and visual preprocessed data. The acoustic preprocessor comprises a log mel spectrum analyzer that produces an equal mel bandwidth log power spectrum. The visual processor detects the motion of a set of fiducial markers on the speaker's face and extracts a set of normalized distance vectors describing lip and mouth movement. The speech classifier uses a multilevel time-delay neural network operating on the preprocessed acoustic and visual data to form an output probability distribution that indicates the probability of each candidate utterance having been spoken, based on the acoustic and visual data. The training system includes the speech recognition apparatus and a control processor with an associated memory. Noisy acoustic input training data together with visual data is used to generate acoustic and visual feature training vectors for processing by the speech classifier. A control computer adjusts the synaptic weights of the speech classifier based upon the noisy input training data and exemplar output vectors for producing a robustly trained classifier based on the analogous visual counterpart of the Lombard effect.

REFERENCES:
patent: 4620286 (1986-10-01), Smith et al.
patent: 4757541 (1988-07-01), Beadles
patent: 4937872 (1990-06-01), Hopfield et al.
patent: 4975960 (1990-12-01), Petajan
patent: 5027406 (1991-06-01), Roberts et al.
patent: 5163111 (1992-11-01), Baji et al.
patent: 5175793 (1992-12-01), Sakamoto et al.
Waibel, A., "Modular Construction of Time-Delay Neural Networks for Speech Recognition," Neural Computation, 1, pp. 39-46 (1989).
Petajan, E., et al., "An Improved Automatic Lipreading System to Enhance Speech Recognition," ACM SIGCHI-88, pp. 19-25 (1988).
Pentland, A., et al., "Lip Reading:Automatic Visual Recognition of Spoken Words," Proc. Image Understanding and Machine Vision, Optical Society of America, pp. 1-9 (Jun. 12-14, 1989).
Yuhas, B. P., et al., "Integration of Acoustic and Visul Speech Signals Using Neural Networks," IEEE Communications Magazine, pp. 65-71 (Nov. 1989).
Waibel, A., et al., "Phoneme Recognition: Neural Networks vs. Hidden Markov Models," IEEE ICASSP88 Proceedings, vol. 1, pp. 107-110 (1988).
T. J. Sejinowski et al., "Combining Visual and Acoustic Speech Signals with a Neural Network Improves Intelligibility," Advances in Neural Info. Proceeding Systems 2, 8 pgs. (undated).
Neural Net. Lipreading System for improved speech Recognition Stork et al. IEEE 7-11 Jun. 1992.
An Animated display of Tongue, Lip and Jaw movements during speech: Hochberg et al IEEE 1-5 Feb. 1992.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Neural network acoustic and visual speech recognition system tra does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Neural network acoustic and visual speech recognition system tra, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Neural network acoustic and visual speech recognition system tra will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-368318

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.