Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Patent
1996-12-19
1998-11-17
Dorvil, Richemond
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
704219, 704220, G10L 302
Patent
active
058390987
ABSTRACT:
Coding systems that provide a perceptually improved approximation of the short-term characteristics of speech signals compared to typical coding techniques such as linear predictive analysis while maintaining enhanced coding efficiency. The invention advantageously employs a non-linear transformation and/or a spectral warping process to enhance particular short-term spectral characteristic information for respective voiced intervals of a speech signal. The non-linear transformed and/or warped spectral characteristic information is then coded, such as by linear predictive analysis to produce a corresponding coded speech signal. The use of the non-linear transformation and/or spectral warping operation of the particular spectral information advantageously causes more coding resources to be used for those spectral components that contribute greater to the perceptible quality of the corresponding synthesized speech. It is possible to employ this coding technique in a variety of speech coding techniques including, for example, vocoder and analysis-by-synthesis coding systems.
REFERENCES:
patent: Re32580 (1988-01-01), Atal et al.
patent: 3624302 (1971-11-01), Atal
patent: 4220819 (1980-09-01), Atal
patent: 4472832 (1984-09-01), Atal et al.
patent: 4827517 (1989-05-01), Atal et al.
patent: 5267317 (1993-11-01), Kleijn
patent: 5371853 (1994-12-01), Kao et al.
patent: 5481642 (1996-01-01), Shoham
patent: 5495556 (1996-02-01), Honda
patent: 5513297 (1996-04-01), Kleijn et al.
Wu, et al. "An investigation of sinusoidal speech coding" Proceedings Of Fourth International Symposium On Signal Processing And Its Applications, vol. 1, pp. 25-30 Aug. 1996.
Hicks, et al. "Pitch Invariant frequency lowering with nonuniform spectral compression" International conference On Acoustics, Speech and Signal Processing, vol. 1, pp. 121-124 (1981).
Nelson, "The Mellin-wavelet transform" International Conference On Acoustics, Speech, And Signal Processing, vol. 2, pp. 9-12 (1995).
B. Atal, et al. "Stochastic Coding of Speech Signals at Very Low Bit Rates", Proc IEEE Int. Conf. Comm., p. 48.1 (May 1984).
M. Schroeder et al., "Code-Excited Linear Predictive (CELP): High Quality Speech at Very Low Bit Rates", Proc. IEEE Int. Conf. ASSP., pp. 937-940 (1985).
P. Kroon et al., "A Class of Analysis-by-Synthesis Predictive Coders for High-Quality Speech Coding at Rate Between 4.8 and 16 KB/s", IEEE J. on Sel. Areas in Comm., SAC-6(2), pp. 353-363 (Feb. 1988).
L.R. Rabiner et al., Digital Processing of Speech Signals, pp. 150-157, sects. 6.0-6.1, pp. 250-282, 372-378, 404-407, 447-450 (Prentice-Hall, New Jersey, 1978).
Laroia Rajiv
Yeo Boon-Lock
Dorvil Richemond
Finston Martin I.
Lucent Technologies - Inc.
Rudnick Robert E.
LandOfFree
Speech coder methods and systems does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speech coder methods and systems, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech coder methods and systems will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-896936