Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Patent
1996-10-21
1998-07-14
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
704214, G10L 702, H03M 730
Patent
active
057818819
ABSTRACT:
A method and a device are described for classifying speech on the basis of the wavelet transformation for low-bit-rate speech coding processes. The method and the device permit a more robust classifier of speech signals for signal-matched control of speech coding processes in order to reduce the bit rate without affecting the speech quality or to increase the quality at the same bit rate. The method provides that, after segmenting the speech signal, a wavelet transformation is calculated for each frame, from which a set of parameters is determined with the help of adaptive thresholds. The parameters control a finite-state model, which subdivides the frames into shorter subframes if required, and classifies each subframe into one of several classes typical for speech coding. The speech signal is classified on the basis of the wavelet transformation for each time frame. Thus both a high time resolution (location of pulses) and frequency resolution (good mean values) can be achieved. This method and the classifier are therefore especially well suited for the control and selection of code books in a low-bit-rate speech coder. They also have a low sensitivity to background noise and low complexity.
REFERENCES:
patent: 5490170 (1996-02-01), Akagiri et al.
patent: 5495555 (1996-02-01), Swaminathan
patent: 5596676 (1997-01-01), Swaminathan et al.
Olivier Rioul and Martin Vetterli, "Wavelets and Signal Processing," IEEE Signal Processing Magazine, vol. 8, No. 4, pp. 14-38, Oct. 1991.
Stephane G. Mallat and Sifen Zhong, "Characterization of Signals from Multiscale Edges," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 14, No. 7, pp. 710-732, Jul. 1992.
Shubha Kadambe and G. Faye Bourdeaux-Bartels, "Application of the Wavelet Transform for Pitch Detection of Speech Signals," IEEE Trans. Information Theory, vol. 38, No. 2, pp. 917-924, Mar. 1992.
Joachim Stegmann, Gerhard Schroder, and Kyrill A. Fischer "Robust Classification of Speech Based on the Dyadic Wavelet Transform with Application to CELP Coding,"Proc. ICASSP 96, pp. 546-549, May, 1996.
Deutsche Telekom AG
Hudspeth David R.
Smits Talivaldis Ivars
LandOfFree
Variable-subframe-length speech-coding classes derived from wave does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Variable-subframe-length speech-coding classes derived from wave, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Variable-subframe-length speech-coding classes derived from wave will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1894023