Speech recognition method using time-frequency masking mechanism

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

395 247, 395 252, G10L 506

Patent

active

054598150

ABSTRACT:
A speech recognition method in which input speech signals are converted to digital signals and then time sequentially converted to cepstrum coefficients or logarithmic spectra. Dynamic spectrum time sequence is obtained by time frequency filtering of cepstrum coefficients, or masked spectrum time sequence is obtained by time frequency masking of the logarithmic vector time sequence. Based on the dynamic cepstrum time sequence or masked spectrum time sequence obtained in this manner, speech is recognized.

REFERENCES:
patent: 4956865 (1990-09-01), Lennig et al.
patent: 5067158 (1991-11-01), Arjmand
patent: 5097510 (1992-03-01), Graupe
patent: 5202926 (1993-04-01), Miki
patent: 5268685 (1993-12-01), Fujiwara
S. Furui, "Speaker-Independent Isolated Word Recognition Using Dynamic Features of Speech Spectrum", IEEE Trans., ASSP-34, No. 1, pp. 52-59, (1986-2).
D. Klatt, "Prediction of Perceived Phonetic Distance from Critical-Band Spectra: A First Step", Proc. ICASSP82, pp. 1278-1281, (May 1982).
B. Hanson et al., "Spectral Slope Distance Measures with Liner Prediction Analysis for Word Recognition in Noise", IEEE Trans. ASSP-35, No. 7, pp. 968-973, (Jul. 1987).
K. Aikawa et al., "Spectral Movement Function and Its Application to Speech Recognition", Proc. ICASSP88, 223-226, (Apr. 1988).
E. Miyasaka, "Spatio-Temporal Characteristics of Masking of Brief Test-Tone Pulses by a Tone-Burst with Abrupt Switching Transients", vol. 39, No. 9, pp. 614-623, (1983).
J. Markel et al., "Linear Prediction of Speech", Spriinger-Verlag (1976).
Y. Linde et al., "An Algorithm for Vector Quantizer Design", IEEE Transactions on Communications, vol. Com-28, No. 1, pp. 84-95 (1980).
L. Baum, "An Inequality and Associated Maximization Technique in Statistical Estimation for Probabilistic Functions of Markov Processes", 3, pp. 1-8, (1972).
P. Brown, "The Acoustic-Modeling Problem in Automatic Speech Recognition", Ph. D. thesis, Carnegie-Mellon University (1987).
H. Sakoe et al., "Dynamic Programming Algorithm Optimization for Spoken Word Recognition", IEEE Trans. on Acoustics. Speech, and Signal Processing, vol. ASSp-26, No. 1, (1978-Feb.).
Lee et al, "An Overview of the Sphinx Speech Recognition System" IEEE Trans. on Accoustics, Speech, and Signal Processing vol. 38, No. 1, Jan. 1990, pp. 35-45.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Speech recognition method using time-frequency masking mechanism does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Speech recognition method using time-frequency masking mechanism, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech recognition method using time-frequency masking mechanism will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-604393

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.