Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2003-01-21
2009-11-10
Armstrong, Angela A (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
Reexamination Certificate
active
07617104
ABSTRACT:
A method of speech recognition is provided that determines a production-related value, vocal-tract resonance frequencies in particular, for a state at a particular frame based on the production-related values associated with two preceding frames using a recursion. The production-related value is used to determine a probability distribution of the observed feature vector for the state. A probability for an observed value received for the frame is then determined from the probability distribution. Under one embodiment, the production-related value is determined using a noise-free recursive definition for the value. Use of the recursion substantially improves the decoding speed. When the decoding algorithm is applied to training data with known phonetic transcripts, forced alignment is created which improves the phone segmentation obtained from the prior art.
REFERENCES:
patent: 5806029 (1998-09-01), Buhrke et al.
patent: 5893058 (1999-04-01), Kosaka
patent: 2001/0044719 (2001-11-01), Casey
Ma, J. et al., “A mixture Linear Model With Target-Directed Dynamics For Spontaneous Speech Recognition,” 2002 IEEE International Conference on Acoustics, Speech and Signal Processing, Orlando, FL, May 13-17, 2002 pp. I-961-4 vol. 1.
Picone, J. et al., “Initial Evaluation of Hidden Dynamic Models on Conversational Speech,” 1999 IEEE International Conference on Acoustics, Speech and Signal ICASSP99, Phoenix, AZ, vol. 1, pp. 15-19.
Richards, H.B. et al., “The HDM: A Segmental Hidden Dynamic Model of Coarticulation,” 1999 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP99, Phoenix, AZ, vol. 1, pp. 357-360.
European Search Report for corresponding EPO Patent Application 04001079.5.
H.G. Hirsch and D. Pearce, “The AURORA Experimental Framework for the Performance Evaluations of Speech Recognition Systems Under Noisy Conditions,” ISCA ITRW ASR2000, Automatic Speech Recognition: Challenges for the Next Millennium (Sep. 2000).
Li Deng et al., “Large Vocabulary Continuous Speech Recognition Under Adverse Conditions,” In Proceedings of the ICSLP, vol. 3, pp. 806-809 (Oct. 2000).
Li Deng et al., “High-Performance Robust Speech Recognition Using Stereo Training Data,” In International Conference on Acoustics, Speech and Signal Processing (May 2001).
Jasha Droppo et al., “Efficient On-Line Acoustic Environment Estimation for FCDCN in A Continuous Speech Recognition System,” In International Conference on Acoustics, Speech and Signal Processing (May 2001).
P. Moreno, “Speech Recognition in Noisy Environments,” Ph.D. Thesis, Carnegie Mellong University (1996).
Brendan Frey, et al., “ALGONQUIN: Iteratin Laplace's Method to Remove Multiple Types of Noise and Channel Distortion from Log-Spectra in Robust Speech Recognition,” (2001).
A. Acero, et al., “HMM Adaptation Using Vector Taylor Series for Noisy Speech Recognition,” Proceedings ICSLP, vol. 2, pp. 869-872 (2000).
M. Afify and O. Siohan, “Sequential Noise Estimation With Optimal Forgetting for Robust Speech Recognition,” Proceedings ICASSP, vol. 1, pp. 229-232 (2001).
N. S. Kim, “Nonstationary Environment Compensation Based on Sequential Estimation,” IEEE Signal Processing Letters, vol. 5, pp. 57-60 (1998).
V. Krishnamurthy and J.B. Moore, “On-Line Estimation of Hidden Markov Model Parameters Based on the Kullback-Leibler Information Meature,” IEEE Transaction Signal Processing, vol. 41, pp. 2557-2573 (1993).
P. Moreno et al., “A Vector Taylor Series Approach for Environment-Independent Speech Recognition,” Proceedings ICASSP, vol. 1, pp. 733-736 (1996).
First Office Action in counterpart Chinese application 200410005917.7, filed Jan. 21, 2004.
Examination report for European Patent Application No. 04001079.5 filed on Jan. 20, 2004.
Deng Li
Seide Frank Torsten Bernd
Zhou Jian-Iai
Armstrong Angela A
Magee Theodore M.
Microsoft Corporation
Westman Champlin & Kelly P.A.
LandOfFree
Method of speech recognition using hidden trajectory Hidden... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method of speech recognition using hidden trajectory Hidden..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method of speech recognition using hidden trajectory Hidden... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4080157