Patent
1994-01-28
1997-04-22
MacDonald, Allen R.
395 23, 395 235, 395 239, 395 237, G10L 900
Patent
active
056235771
ABSTRACT:
The invention relates in general to low bit-rate encoding and decoding of information such as audio information. More particularly, the invention relates to computationally efficient adaptive bit allocation and quantization of encoded information useful in high-quality low bit-rate coding systems.
In audio applications, a digital split-band encoder splits an input signal into frequency subband signals having bandwidths commensurate with the critical bandwidths of the human auditory system, quantizes the subband signals according to values established by an allocation function, and assembles the quantized subband signals into an encoded signal. The allocation function establishes allocation values in accordance with psychoacoustic principles with allowance for decoding synthesis filter bank spectral distortions.
In one embodiment, an allocation function establishes allocation values using a psychoacoustic masking threshold generated by estimating the power spectral density (PSD) of the input signal, generating an excitation pattern by applying a basilar-membrane spreading function to the PSD, adjusting the excitation pattern by an amount equal to a sensitivity function which specifies a signal-to-noise ratio (SNR) sufficient to achieve psychoacoustic masking, comparing the level of the adjusted pattern to the threshold of hearing and generating the psychoacoustic masking threshold which is equal to the larger of the two. An allocation function may allow for decoder synthesis filter bank spectral distortions in any of a number of ways such as by adapting the sensitivity function.
REFERENCES:
patent: 5301255 (1994-04-01), Nagai et al.
patent: 5475789 (1995-12-01), Nishiguchi
Thomas W. Parsons, Voice and Speech Processing, McGraw-Hill, New York, NY, p. 9 1987.
Harris; "On the Use of Windows for Harmonic Analysis with the Discrete Fourier Transform," Proc. of IEEE, vol. 66, Jan. 1978, pp. 51-58, 81-83.
Schroeder, et al.; "Optimizing Digital Speech Coders by Exploiting Masking Properties of the Human Ear," J. Acoust. Soc. Am., Dec. 1979, pp. 1647-1652.
Johnston; "Transform Coding of Audio Signals Using Perceptual Noise Criteria," IEEE J. on Selected Areas in Comm., Feb. 1988, pp/ 314-323.
Mahieux, et al.; "Transform Coding of Audio Signals at 64 kBit/s," Proc. IEEE GLOBECOM, Dec. 1990, vol. 2, pp. 518-522.
"Coding of Moving Pictures and Associated Audio for Digital Storage Media at Up to About 1.5 Mbit/s," CD 11172-3, ISO/IEC JTCI/SC29, 1992, pp. 2, D-19 to D-24.
Kuusama, et al.; "Capacity and Properties of Slave Mode Hidden Channel Coding", IEEE Int. Conf. Sys. Engr., Sep. 1992, pp. 467-472.
Dolby Laboratories Licensing Corporation
Gallagher Thomas A.
Lathrop David N.
MacDonald Allen R.
Sartori Michael A.
LandOfFree
Computationally efficient adaptive bit allocation for encoding m does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Computationally efficient adaptive bit allocation for encoding m, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Computationally efficient adaptive bit allocation for encoding m will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-347619