Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2008-03-18
2008-03-18
Azad, Abul K. (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S256000
Reexamination Certificate
active
07346510
ABSTRACT:
A method and computer-readable medium are provided that determine predicted acoustic values for a sequence of hypothesized speech units using modeled articulatory or VTR dynamics values and using the modeled relationship between the articulatory (or VTR) and acoustic values for the same speech events. Under one embodiment, the articulatory (or VTR) dynamics value depends on articulatory dynamics values at pervious time frames and articulation targets. In another embodiment, the articulatory dynamics value depends in part on an acoustic environment value such as noise or distortion. In a third embodiment, a time constant that defines the articulatory dynamics value is trained using a variety of articulation styles. By modeling the articulatory or VTR dynamics value in these manners, hyper-articulated, hypo-articulated, fast, and slow speech can be better recognized and the requirement for the training data can be reduced.
REFERENCES:
patent: 4918735 (1990-04-01), Morito et al.
patent: 4980917 (1990-12-01), Hutchins
patent: 5012519 (1991-04-01), Adlersberg et al.
patent: 5924065 (1999-07-01), Eberman et al.
patent: 6092045 (2000-07-01), Stubley et al.
patent: 6778954 (2004-08-01), Kim et al.
L. Deng, “A Dynamic, Feature-Based Approach to the Interface Between Phonology and Phonetics for Speech Modeling and Recognition,” Speech Communication, vol. 24, No. 4, pp. 299-323 (1998).
L. Deng and Z. Ma, “Spontaneous Speech Recognition Using A Statistical Coarticulatory Model for the Hidden Vocal-Tract-Resonance Dynamics,” J. Acoust. Soc. Am., vol. 108, No. 6, pp. 3036-3048 (2000).
L. Deng and J. Ma, “A Statistical Coarticulatory Model for the Hidden Vocal-Tract-Resonance Dynamics,” Proc. of Eurospeech, vol. 4, pp. 1499-1502 (Sep. 1999).
L. Deng and H. Sameti, “Transitional Speech Units and Their Representation by the Regressive Markov States: Applications to Speech Recognition,” IEEE Trans. Speech Audio Proc., vol. 4, No. 4, pp. 301-306 (Jul. 1996).
L. Deng and D. Sun, “A Statistical Approach to Automatic Speech Recognition Using the Atomic Speech Units Constructed from Overlapping Articulatory Features,” J. Acoust. Soc. Am., vol. 95, pp. 2702-2719 (1994).
Y. Gao et al., “Multistage Coarticulation Model Combining Articulatory, Formant and Cepstral Features,” Poc. ICSLP. vol. 1, pp. 25-28 (2000).
J. Ma and L. Deng, “A Path-Stack Algorithm for Optimizing Dynamic Regimes in a Statistical Hidden Dynamic Model of Speech,” Computer Speech and Language, vol. 14, pp. 101-104 (2000).
M. Ostendorf et al., “From HMMs to Segment Models: A Unified View of Stochastic Modeling for Speech Recognition,” IEEE Trans. Speech Audio Proc., vol. 4, pp. 360-378 (1996).
J. Ma and L. Deng, “Target-Directed Mixture Linear Dynamic Models for Spontaneous Speech Recognition,” IEEE Trans. Speech and Audio Processing (submitted 1999, to appear 2002).
J. Bridle et al., “The WS98 Final Report on the Dynamic Model,” http://www.clsp.jhu.edu/ws98/projects/dynamic/presentations/finalhtml/index.html, Johns Hopkins Univ. 1998).
F. -L. Chen et al., “The Structure and Its Implementation of Hidden Dynamic HMM for Mandarin Speech Recognition,” Proc. ICSLP, pp. 713-716, Denver (2002).
Li Deng and Jeff Ma, “Spontaneous speech recognition using a statistical coarticulatory model for the vocal-tract-resonance dynamics,” J. Acoust. Soc. Am. 105(5), Pt. 1, Nov. 2002.
Jeff Ma and Li Deng, “A path-stack algorithm for optimizing dynamic regimes in a statistical hidden dynamic model of speech,”Computer Speech and Language 2000, 00, 1-14.
Azad Abul K.
Magee Theodore M.
Microsoft Corporation
Westman Champlin & Kelly P.A.
LandOfFree
Method of speech recognition using variables representing... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method of speech recognition using variables representing..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method of speech recognition using variables representing... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2803833