Patent
1995-09-19
1998-01-20
MacDonald, Allen R.
395 226, 395 238, G10L 914
Patent
active
057108633
ABSTRACT:
A speech compression system called "Transform Predictive Coding", or TPC, provides for encoding 7 kHz wideband speech (160 kHz sampling) at a target bit-rate range of 16 to 32 kb/s (1 to 2 bits/sample). The system uses short-term and long-term prediction to remove the redundancy in speech. A prediction residual is transformed and coded in the frequency domain to take advantage of knowledge in human auditory perception. The TPC coder uses only open-loop quantization and therefore has a fairly low complexity. The speech quality of TPC is essentially transparent at 32 kb/s, very good at 24 kb/s, and acceptable at 16 kb/s.
REFERENCES:
patent: Re32580 (1988-01-01), Atal et al.
patent: 4811396 (1989-03-01), Yatsuzuka
patent: 4896362 (1990-01-01), Veldhuis et al.
patent: 4969192 (1990-11-01), Chen et al.
patent: 5314457 (1994-05-01), Jeutter et al.
patent: 5327520 (1994-07-01), Chen
patent: 5533052 (1996-07-01), Bhaskar
W.W. Chang et.al., "Audio Coding Using Masking-Threshold Adapted Perceptual Filter," Proc. IEEE Workshop Speech Coding for Telecomm., pp. 9-10, Oct. 1993.
L.R. Rabiner et.al., Digital Processing of Speech Signals, Prentice-Hall, Inc., Englewood Cliffs, NJ, 1978.
Y. Tohkura et.al., "Spectral Smoothing Technique in PARCOR Speech Analysis-Synthesis," IEEE Trans. Acoust., Speech, Signal Processing, ASSP-26:587-596, Dec. 1978.
J.H. Chen, "A Robust Low-Delay CELP Speech Coder at 16kbits/," Proc. IEEE Global Comm. Conf., pp. 1237-1241, Dallas, TX, Nov. 1989.
F.K. Soong et.al., "Line Spectrum Pair (LSP) and Speech Data Compression," Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, pp. 1.10.1-1.10.4, March 1984.
K.K. Paliwal et.al., "Efficient Vector Quantization of LPC Parameters at 24 bits/frame," Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, pp. 661-664, Toronto, Canada, May 1991.
N. Jayant et.al., "Signal Compression Based on Models of Human Perception," Proc. IEEE, pp. 1385-1422, Oct. 1993.
J.V. Tobias ed., Foundations of Modern Auditory Theory, Academic Press, New York and London, 1970.
M.R. Schroeder et.al., "Optimizing Digital Speech Coders by Exploiting Masking Properties of the Human Ear," J. Acoust. Soc. Amer., 66:1647-1652, Dec. 1979.
Brown Kenneth M.
Dorvil Richemond
MacDonald Allen R.
Restaino Thomas A.
LandOfFree
Speech signal quantization using human auditory models in predic does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speech signal quantization using human auditory models in predic, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech signal quantization using human auditory models in predic will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-731816