Patent
1993-06-21
1995-10-17
MacDonald, Allen R.
395 247, 395 252, G10L 506
Patent
active
054598150
ABSTRACT:
A speech recognition method in which input speech signals are converted to digital signals and then time sequentially converted to cepstrum coefficients or logarithmic spectra. Dynamic spectrum time sequence is obtained by time frequency filtering of cepstrum coefficients, or masked spectrum time sequence is obtained by time frequency masking of the logarithmic vector time sequence. Based on the dynamic cepstrum time sequence or masked spectrum time sequence obtained in this manner, speech is recognized.
REFERENCES:
patent: 4956865 (1990-09-01), Lennig et al.
patent: 5067158 (1991-11-01), Arjmand
patent: 5097510 (1992-03-01), Graupe
patent: 5202926 (1993-04-01), Miki
patent: 5268685 (1993-12-01), Fujiwara
S. Furui, "Speaker-Independent Isolated Word Recognition Using Dynamic Features of Speech Spectrum", IEEE Trans., ASSP-34, No. 1, pp. 52-59, (1986-2).
D. Klatt, "Prediction of Perceived Phonetic Distance from Critical-Band Spectra: A First Step", Proc. ICASSP82, pp. 1278-1281, (May 1982).
B. Hanson et al., "Spectral Slope Distance Measures with Liner Prediction Analysis for Word Recognition in Noise", IEEE Trans. ASSP-35, No. 7, pp. 968-973, (Jul. 1987).
K. Aikawa et al., "Spectral Movement Function and Its Application to Speech Recognition", Proc. ICASSP88, 223-226, (Apr. 1988).
E. Miyasaka, "Spatio-Temporal Characteristics of Masking of Brief Test-Tone Pulses by a Tone-Burst with Abrupt Switching Transients", vol. 39, No. 9, pp. 614-623, (1983).
J. Markel et al., "Linear Prediction of Speech", Spriinger-Verlag (1976).
Y. Linde et al., "An Algorithm for Vector Quantizer Design", IEEE Transactions on Communications, vol. Com-28, No. 1, pp. 84-95 (1980).
L. Baum, "An Inequality and Associated Maximization Technique in Statistical Estimation for Probabilistic Functions of Markov Processes", 3, pp. 1-8, (1972).
P. Brown, "The Acoustic-Modeling Problem in Automatic Speech Recognition", Ph. D. thesis, Carnegie-Mellon University (1987).
H. Sakoe et al., "Dynamic Programming Algorithm Optimization for Spoken Word Recognition", IEEE Trans. on Acoustics. Speech, and Signal Processing, vol. ASSp-26, No. 1, (1978-Feb.).
Lee et al, "An Overview of the Sphinx Speech Recognition System" IEEE Trans. on Accoustics, Speech, and Signal Processing vol. 38, No. 1, Jan. 1990, pp. 35-45.
Aikawa Kiyoaki
Kawahara Hideki
Tohkura Yoh'Ichi
ATR Auditory and Visual Perception Research Laboratories
MacDonald Allen R.
Onka Thomas
LandOfFree
Speech recognition method using time-frequency masking mechanism does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speech recognition method using time-frequency masking mechanism, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech recognition method using time-frequency masking mechanism will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-604393