Method of speech recognition using variables representing...

Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S256000

Reexamination Certificate

active

07346510

ABSTRACT:
A method and computer-readable medium are provided that determine predicted acoustic values for a sequence of hypothesized speech units using modeled articulatory or VTR dynamics values and using the modeled relationship between the articulatory (or VTR) and acoustic values for the same speech events. Under one embodiment, the articulatory (or VTR) dynamics value depends on articulatory dynamics values at pervious time frames and articulation targets. In another embodiment, the articulatory dynamics value depends in part on an acoustic environment value such as noise or distortion. In a third embodiment, a time constant that defines the articulatory dynamics value is trained using a variety of articulation styles. By modeling the articulatory or VTR dynamics value in these manners, hyper-articulated, hypo-articulated, fast, and slow speech can be better recognized and the requirement for the training data can be reduced.

REFERENCES:
patent: 4918735 (1990-04-01), Morito et al.
patent: 4980917 (1990-12-01), Hutchins
patent: 5012519 (1991-04-01), Adlersberg et al.
patent: 5924065 (1999-07-01), Eberman et al.
patent: 6092045 (2000-07-01), Stubley et al.
patent: 6778954 (2004-08-01), Kim et al.
L. Deng, “A Dynamic, Feature-Based Approach to the Interface Between Phonology and Phonetics for Speech Modeling and Recognition,” Speech Communication, vol. 24, No. 4, pp. 299-323 (1998).
L. Deng and Z. Ma, “Spontaneous Speech Recognition Using A Statistical Coarticulatory Model for the Hidden Vocal-Tract-Resonance Dynamics,” J. Acoust. Soc. Am., vol. 108, No. 6, pp. 3036-3048 (2000).
L. Deng and J. Ma, “A Statistical Coarticulatory Model for the Hidden Vocal-Tract-Resonance Dynamics,” Proc. of Eurospeech, vol. 4, pp. 1499-1502 (Sep. 1999).
L. Deng and H. Sameti, “Transitional Speech Units and Their Representation by the Regressive Markov States: Applications to Speech Recognition,” IEEE Trans. Speech Audio Proc., vol. 4, No. 4, pp. 301-306 (Jul. 1996).
L. Deng and D. Sun, “A Statistical Approach to Automatic Speech Recognition Using the Atomic Speech Units Constructed from Overlapping Articulatory Features,” J. Acoust. Soc. Am., vol. 95, pp. 2702-2719 (1994).
Y. Gao et al., “Multistage Coarticulation Model Combining Articulatory, Formant and Cepstral Features,” Poc. ICSLP. vol. 1, pp. 25-28 (2000).
J. Ma and L. Deng, “A Path-Stack Algorithm for Optimizing Dynamic Regimes in a Statistical Hidden Dynamic Model of Speech,” Computer Speech and Language, vol. 14, pp. 101-104 (2000).
M. Ostendorf et al., “From HMMs to Segment Models: A Unified View of Stochastic Modeling for Speech Recognition,” IEEE Trans. Speech Audio Proc., vol. 4, pp. 360-378 (1996).
J. Ma and L. Deng, “Target-Directed Mixture Linear Dynamic Models for Spontaneous Speech Recognition,” IEEE Trans. Speech and Audio Processing (submitted 1999, to appear 2002).
J. Bridle et al., “The WS98 Final Report on the Dynamic Model,” http://www.clsp.jhu.edu/ws98/projects/dynamic/presentations/finalhtml/index.html, Johns Hopkins Univ. 1998).
F. -L. Chen et al., “The Structure and Its Implementation of Hidden Dynamic HMM for Mandarin Speech Recognition,” Proc. ICSLP, pp. 713-716, Denver (2002).
Li Deng and Jeff Ma, “Spontaneous speech recognition using a statistical coarticulatory model for the vocal-tract-resonance dynamics,” J. Acoust. Soc. Am. 105(5), Pt. 1, Nov. 2002.
Jeff Ma and Li Deng, “A path-stack algorithm for optimizing dynamic regimes in a statistical hidden dynamic model of speech,”Computer Speech and Language 2000, 00, 1-14.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method of speech recognition using variables representing... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method of speech recognition using variables representing..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method of speech recognition using variables representing... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2803833

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.