Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Patent
1997-04-02
2000-01-11
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
704219, 704229, G10L 302, G10L 900
Patent
active
060146214
ABSTRACT:
A speech compression system called "Transform Predictive Coding", or TPC, provides for encoding 7 kHz wideband speech (16 kHz sampling) at a target bit-rate range of 16 to 32 kb/s (1 to 2 bits/sample). The system uses short-term and long-term prediction to remove the redundancy in speech. A prediction residual is transformed and coded in the frequency domain to take advantage of knowledge in human auditory perception. The TPC coder uses only open-loop quantization and therefore has a fairly low complexity. The speech quality of TPC is essentially transparent at 32 kb/s, very good at 24 kb/s, and acceptable at 16 kb/s.
REFERENCES:
patent: Re32580 (1988-01-01), Atal et al.
patent: 5081681 (1992-01-01), Hardwick
patent: 5127053 (1992-06-01), Koch
patent: 5311561 (1994-05-01), Akagiri
patent: 5314457 (1994-05-01), Jeutter et al.
patent: 5327520 (1994-07-01), Chen
patent: 5450522 (1995-09-01), Hermansky et al.
patent: 5469474 (1995-11-01), Kitabatake
patent: 5475789 (1995-12-01), Nishiguchi
patent: 5533052 (1996-07-01), Bhaskar
W.W. Chang et. al., "Audio Coding Masking-Threshold Adapted Perceptual Filter," Proc. IEEE Workshop Speech Coding for Telecomm., pp. 9-10, Oct. 1993.
L.R. Rabiner et. al., Digital Processing of Speech Signals, Prentice-Hall, Inc., Englewood Cliffs, NJ, 1978.
Y. Tohkura et. al., "Spectral Smoothing Technique in PARCOR Speech Analysis-Synthesis," IEEE Trans. Acoust., Speech, Signal Processing, ASSP-26:587-596, Dec. 1978.
J.H. Chen, "A Robust Low-Delay CELP Speech Coder at 16 kbit/s, " Proc. IEEE Global Comm. Conf., pp. 1237-1241, Dallas, TX, Nov. 1989.
F.K. Soong et. al., "Line Spectrum Pair (LSP) and Speech Data Compression," Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, pp. 1.10.1-1.10.4, Mar. 1984.
K.K. Paliwal et. al., "Efficient Vector Quantization of LPC Parameters at 24 bits/frame," Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, pp. 661-664, Toronto, Canada, May 1991.
N. Jayant et. al., "Signal Compression Based on Models of Human Perception," Proc. IEEE, pp. 1385-1422, Oct. 1993.
J.V. Tobias, ed., Foundations of Modern Auditory Theory, Academic Press, New York and London, 1970.
M.R. Schroeder et. al., "Optimizing Digital Speech Coders by Exploiting Masking Properties of the Human Ear," J. Acoust. Soc. Amer., 66:1647-1652, Dec. 1979.
Y. Mahieux et al., "High-Quality Audio Transform Coding At 64 kbps," IEEE Transactions on Comm., 42 (1994) Nov., No. 11, New York, US, pp. 3010-3019.
M. R. Schroeder, "Optimizing digital speech coders by exploiting masking properties of the human ear," J. Acust. Soc. Am. 66(6), Dec. 1979, pp. 1647-1652.
N. S. Jayant et al. "Signal Compression Based On Models of Human Perception," Proc. of IEEE, vol. 81, No. 10, Oct. 1993.
Azirani et al, Optimizing speech enchancement by exploiting masking properties of the human ear, IEEE trans ASSP, pp. 800-803, May 1995.
Brown Kenneth M.
Hudspeth David R.
Lucent Technologies - Inc.
Restaino Thomas A.
Sax Robert Louis
LandOfFree
Synthesis of speech signals in the absence of coded parameters does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Synthesis of speech signals in the absence of coded parameters, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Synthesis of speech signals in the absence of coded parameters will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1469524