Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Patent
1997-06-04
1999-09-21
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
704231, G10L 302, G10L 900, G10L 506
Patent
active
059566710
ABSTRACT:
The present invention includes a method of generating a set of substantially shift invariant acoustic features from an input speech signal which comprises the steps of: splitting the input speech signal into a plurality of input speech signals; respectively delaying a majority of the input speech signals by a successively incrementing time interval; respectively extracting a plurality of sets of acoustic features from the plurality of input speech signals; summing the plurality of sets of acoustic features to form a set of summed acoustic features; and dividing the set of summed acoustic features by a number equivalent to the number of sets of acoustic features summed in the summing step thereby forming a set of averaged acoustic features which are substantially shift invariant. Further, the present invention may include a method for generating at least one substantially shift invariant speech recognition model from speech training data which comprises the steps of: inputting the speech training data a first time; extracting acoustic features from the speech training data input the first time; inputting the speech training data a plurality of times thereafter, each time respectively delaying the input speech training data by a successively incrementing time interval; respectively extracting acoustic features from each delayed speech training data input each time; and utilizing at least the acoustic features extracted in the extracting steps to form the at least one speech recognition model which is substantially shift invariant. Still further, the present invention may include a synchrosqueezing process in the feature extraction steps. Also, the invention contemplates implementing these processes individually, in combination with another of the processes, and a combination of all the processes.
REFERENCES:
patent: 5623107 (1991-01-01), Ueda et al.
A Nonlinear Squeezing of the Continuous Wavelet Analysis Based on Auditory Nerve Models, Daubechies et al., Wavelets in Medicine and Biology, edited by A. Alroubi and M. Unser, CRC Press (Jul. 1995, published in Apr. 1996).
Robust Speech and Speaker Recognition Using Instantaneous Frequencies and Amplitudes Obtained with Wavelet-Derived Synchrosqueezing Measures, S. Maes, Program on Spline Functions and the Theory of Wavelets, Centre de Recherches Mathematiques, Universite de Montreal, Canada (Mar. 1996).
Ittycheriah Abraham Poovakunnel
Maes Stephane Herman
Hudspeth David R.
International Business Machines - Corporation
Sax Robert Louis
LandOfFree
Apparatus and methods for shift invariant speech recognition does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Apparatus and methods for shift invariant speech recognition, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Apparatus and methods for shift invariant speech recognition will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-91383