Data processing: speech signal processing – linguistics – language – Audio signal bandwidth compression or expansion
Reexamination Certificate
2006-09-19
2006-09-19
Dorvil, Richemond (Department: 2654)
Data processing: speech signal processing, linguistics, language
Audio signal bandwidth compression or expansion
C707S793000, C707S793000
Reexamination Certificate
active
07110953
ABSTRACT:
A perceptual audio coder is disclosed for encoding audio signals, such as speech or music, with different spectral and temporal resolutions for redundancy reduction and irrelevancy reduction. The disclosed perceptual audio coder separates the psychoacoustic model (irrelevancy reduction) from the redundancy reduction, to the extent possible. The audio signal is initially spectrally shaped using a prefilter controlled by a psychoacoustic model. The prefilter output samples are thereafter quantized and coded to minimize the mean square error (MSE) across the spectrum. The disclosed perceptual audio coder can use fixed quantizer step-sizes, since spectral shaping is performed by the pre-filter prior to quantization and coding. The disclosed pre-filter and post-filter support the appropriate frequency dependent temporal and spectral resolution for irrelevancy reduction. A filter structure based on a frequency-warping technique is used that allows filter design based on a non-linear frequency scale. The characteristics of the pre-filter may be adapted to the masked thresholds (as generated by the psychoacoustic model), using techniques known from speech coding, where linear-predictive coefficient (LPC) filter parameters are used to model the spectral envelope of the speech signal. Likewise, the filter coefficients may be efficiently transmitted to the decoder for use by the post-filter using well-established techniques from speech coding, such as an LSP (line spectral pairs) representation, temporal interpolation, or vector quantization.
REFERENCES:
patent: 5481614 (1996-01-01), Johnston
patent: 5535300 (1996-07-01), Hall et al.
patent: 5627938 (1997-05-01), Johnston
patent: 5687191 (1997-11-01), Lee et al.
patent: 5699484 (1997-12-01), Davis
patent: 5774844 (1998-06-01), Akagiri
patent: 5950156 (1999-09-01), Ueno et al.
patent: 5956674 (1999-09-01), Smyth et al.
patent: 2001/0047256 (2001-11-01), Tsurushima et al.
Srinivasan et al. high-quality audio compression using an adaptive wavelet packet decomposition and psychoacoustic model IEEE Transaction on signal processing, vol. 46, Apr. 1998, pp. 1085-1093.
Smith, “the scientist and engineer's guide to digital signal processing”, ISBN 0-9660176-33, 1997,p. 297-310.
Chang et al., “A Masking-Threshold-Adapted Weighting Filter for Excitation Search,” IEEE Transactions on Speech and Audio Processing, vol. 4, No. 2, 124-132 (Mar. 1996).
Soong et al., “Line Spectrum Pair (LSP) and Speech Data Compression,” in Proc. IEEE International Conference on Acoustics, Speech, Signal Processing, pp. 1.10.1-1.10.4 (Mar. 1984).
Lefebvre et al., “Spectral Amplitude Warping (SAW) for Noise Spectrum Shaping in Audio Coding,” IEEE International Conference on Acoustics, Speech, and Signal Processing, Germany, 335-338 (Apr. 1997).
Edler et al., “Audio Coding Using a Psychoacousti Pre- And Post-Filter,” Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, vol. II, 6-9, pp. 881-884 (Jun. 2000).
Sinha et al., “The Perceptual Audio Coder (PAC),” The Digital Signal Processing Handbook; Madisetti V.K., Douglas, B.W. (Eds); CRC Press, IEEE Press, pp. 42-1-42-18 (1998).
Edler Bernd Andreas
Schuller Gerald Dietrich
Agere Systems Inc.
Dorvil Richemond
Han Qi
LandOfFree
Perceptual coding of audio signals using separated... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Perceptual coding of audio signals using separated..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Perceptual coding of audio signals using separated... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3571581