Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2006-05-23
2006-05-23
Azad, Abul K. (Department: 2654)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S241000
Reexamination Certificate
active
07050975
ABSTRACT:
A method of speech recognition is provided that identifies a production-related dynamics value by performing a linear interpolation between a production-related dynamics value at a previous time and a production-related target using a time-dependent interpolation weight. The hidden production-related dynamics value is used to compute a predicted value that is compared to an observed value of acoustics to determine the likelihood of the observed acoustics given a sequence of hidden phonological units. In some embodiments, the production-related dynamics value at the previous time is selected from a set of continuous values. In addition, the likelihood of the observed acoustics given a sequence of hidden phonological units is combined with a score associated with a discrete class of production-related dynamic values at the previous time to determine a score for a current phonological state.
REFERENCES:
patent: 4980917 (1990-12-01), Hutchins
Asai, K. et al.; Dividing the Distributions of HMM and Linear Interpolation in Speech Recognition; Acoustics, Speech, and SignalProcessing, 1992. ICASSP-92.,IEEE Inter Conference on vol. 1, Mar. 23-26, 1992 pp. 29-32.
Copy of European Search Report from European Application No.: 03014848.0, filed Jun. 30, 2003.
G. Welch, G. Bishop, “An Introduction to the Kalman Filter,” SIGGRAPH 2001 Conference, Aug. 12, 2001, pp. 1-47.
J. Bridle, L. Deng, J. Picone, H. Richards, J. Ma, T. Kamm, M. Schuster, S. Pike, R. Regan, “An Investigation of Segmental Hidden Dynamic Models of Speech Coarticulation for Automatic Speech Recognition,” Report of a Project at the 1998 Workshop on Language Engineering at the Center for Language and Speech Processing, The Johns Hopkins University, 1998, pp. 1-61.
Li Deng, “Articulatory Features and Associated Production Models in Statistical Speech Recognition,” Computational Models of Speech Pattern Processing, NATO ASI Series, 1999, pp. 214-224.
L. Deng, “A Dynamic, Feature-Based Approach to the Interface Between Phonology and Phonetics for Speech Modeling and Recognition,” Speech Communication, vol. 24, No. 4, pp. 299-323 (1998).
L. Deng and Z. Ma, “Spontaneous Speech Recognition Using A Statistical Coarticulatory Model for the Hidden Vocal-Tract-Resonance Dynamics,” J. Acoust. Soc. Am., vol. 108, No. 6, pp. 3036-3048 (2000).
L. Deng and J. Ma, “A Statistical Coarticulatory Model for the Hidden Vocal-Tract-Resonance Dynamics,” Proc. of Eurospeech, vol. 4, pp. 1499-1502 (Sep. 1999).
L. Deng and K. Sameti, “Transitional Speech Units and Their Representation by the Regressive Markov States: Applications to Speech Recognition,” IEEE Trans. Speech Audio Proc., vol. 4, pp. 301-306 (Jul. 1996).
L. Deng and D. Sun, “A Statistical Approach to Automatic Speech Recognition Using the Atomic Speech Units Constructed from Overlapping Articulatory Features,” J. Acoust. Soc. Am., vol. 95, pp. 2702-2719 (1994).
Y. Gao et al., “Multistage Coarticulation Model Combining Articulatory, Formant and Cepstral Features,” Poc. ICSLP. vol. 1, pp. 25-28 (2000).
J. Ma and L. Deng, “A Path-Stack Algorithm for Optimizing Dynamic Regimes in a Statistical Hidden Dynamic Model of Speech,” Computer Speech and Language, vol. 14, pp. 101-104 (2000).
M. Ostendorf et al., “From HMMs to Segment Models: A Unified View of Stochastic Modeling for Speech Recognition,” IEEE Trans. Speech Audio Proc., vol. 4, pp. 360-378 (1996).
J. Ma and L. Deng, “Target-Directed Mixture Linear Dynamic Models for Spontaneous Speech Recognition,” IEEE Trans. Speech and Audio Processing (submitted 1999, to appear 2002).
J. Bridle et al., “The WS98 Final Report on the Dynamic Model,” http://www.clsp.jhu.edu/ws98/projects/dynamic/presentations/finalhtml/index.html, Johns Hopkins Univ. 1998).
F.-L. Chen et al., “The Structure and Its Implementation of Hidden Dynamic HMM for Mandarin Speech Recognition,” Proc. ICSLP, pp. 713-716, Denver (2002).
Feili, Chen et al., The Structure and Its Implementation of Hidden Dynamic HMM For Mandarin Speech Recognition, Proceedings ICLP 2002, Sep. 2002.
Acero Alejandro
Attias Hagai
Deng Li
Gunawardana Asela J. R.
Huang Xuedong
Azad Abul K.
Magee Theodore M.
Microsoft Corporation
Westman Champlin & Kelly P.A.
LandOfFree
Method of speech recognition using time-dependent... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method of speech recognition using time-dependent..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method of speech recognition using time-dependent... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3646946