Method of speech recognition using time-dependent...

Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S241000

Reexamination Certificate

active

07050975

ABSTRACT:
A method of speech recognition is provided that identifies a production-related dynamics value by performing a linear interpolation between a production-related dynamics value at a previous time and a production-related target using a time-dependent interpolation weight. The hidden production-related dynamics value is used to compute a predicted value that is compared to an observed value of acoustics to determine the likelihood of the observed acoustics given a sequence of hidden phonological units. In some embodiments, the production-related dynamics value at the previous time is selected from a set of continuous values. In addition, the likelihood of the observed acoustics given a sequence of hidden phonological units is combined with a score associated with a discrete class of production-related dynamic values at the previous time to determine a score for a current phonological state.

REFERENCES:
patent: 4980917 (1990-12-01), Hutchins
Asai, K. et al.; Dividing the Distributions of HMM and Linear Interpolation in Speech Recognition; Acoustics, Speech, and SignalProcessing, 1992. ICASSP-92.,IEEE Inter Conference on vol. 1, Mar. 23-26, 1992 pp. 29-32.
Copy of European Search Report from European Application No.: 03014848.0, filed Jun. 30, 2003.
G. Welch, G. Bishop, “An Introduction to the Kalman Filter,” SIGGRAPH 2001 Conference, Aug. 12, 2001, pp. 1-47.
J. Bridle, L. Deng, J. Picone, H. Richards, J. Ma, T. Kamm, M. Schuster, S. Pike, R. Regan, “An Investigation of Segmental Hidden Dynamic Models of Speech Coarticulation for Automatic Speech Recognition,” Report of a Project at the 1998 Workshop on Language Engineering at the Center for Language and Speech Processing, The Johns Hopkins University, 1998, pp. 1-61.
Li Deng, “Articulatory Features and Associated Production Models in Statistical Speech Recognition,” Computational Models of Speech Pattern Processing, NATO ASI Series, 1999, pp. 214-224.
L. Deng, “A Dynamic, Feature-Based Approach to the Interface Between Phonology and Phonetics for Speech Modeling and Recognition,” Speech Communication, vol. 24, No. 4, pp. 299-323 (1998).
L. Deng and Z. Ma, “Spontaneous Speech Recognition Using A Statistical Coarticulatory Model for the Hidden Vocal-Tract-Resonance Dynamics,” J. Acoust. Soc. Am., vol. 108, No. 6, pp. 3036-3048 (2000).
L. Deng and J. Ma, “A Statistical Coarticulatory Model for the Hidden Vocal-Tract-Resonance Dynamics,” Proc. of Eurospeech, vol. 4, pp. 1499-1502 (Sep. 1999).
L. Deng and K. Sameti, “Transitional Speech Units and Their Representation by the Regressive Markov States: Applications to Speech Recognition,” IEEE Trans. Speech Audio Proc., vol. 4, pp. 301-306 (Jul. 1996).
L. Deng and D. Sun, “A Statistical Approach to Automatic Speech Recognition Using the Atomic Speech Units Constructed from Overlapping Articulatory Features,” J. Acoust. Soc. Am., vol. 95, pp. 2702-2719 (1994).
Y. Gao et al., “Multistage Coarticulation Model Combining Articulatory, Formant and Cepstral Features,” Poc. ICSLP. vol. 1, pp. 25-28 (2000).
J. Ma and L. Deng, “A Path-Stack Algorithm for Optimizing Dynamic Regimes in a Statistical Hidden Dynamic Model of Speech,” Computer Speech and Language, vol. 14, pp. 101-104 (2000).
M. Ostendorf et al., “From HMMs to Segment Models: A Unified View of Stochastic Modeling for Speech Recognition,” IEEE Trans. Speech Audio Proc., vol. 4, pp. 360-378 (1996).
J. Ma and L. Deng, “Target-Directed Mixture Linear Dynamic Models for Spontaneous Speech Recognition,” IEEE Trans. Speech and Audio Processing (submitted 1999, to appear 2002).
J. Bridle et al., “The WS98 Final Report on the Dynamic Model,” http://www.clsp.jhu.edu/ws98/projects/dynamic/presentations/finalhtml/index.html, Johns Hopkins Univ. 1998).
F.-L. Chen et al., “The Structure and Its Implementation of Hidden Dynamic HMM for Mandarin Speech Recognition,” Proc. ICSLP, pp. 713-716, Denver (2002).
Feili, Chen et al., The Structure and Its Implementation of Hidden Dynamic HMM For Mandarin Speech Recognition, Proceedings ICLP 2002, Sep. 2002.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method of speech recognition using time-dependent... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method of speech recognition using time-dependent..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method of speech recognition using time-dependent... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3646946

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.