Data processing: speech signal processing – linguistics – language – Audio signal bandwidth compression or expansion
Reexamination Certificate
1997-09-25
2002-10-15
Tsang, Fan (Department: 2645)
Data processing: speech signal processing, linguistics, language
Audio signal bandwidth compression or expansion
C704S226000
Reexamination Certificate
active
06466912
ABSTRACT:
FIELD OF THE INVENTION
This invention relates to perceptually-based coding of audio signals, such as monophonic, stereophonic, or multichannel audio signals, speech, music, or other material intended to be perceived by the human ear.
BACKGROUND OF THE INVENTION
Demands in the commercial market for increased quality in the reproduction of audio signals have led to investigations of digital techniques which promise the possibility of preserving much of the original signal quality. However, a straight-forward application of conventional digital coding would lead to excessive data rates; so acceptable techniques of data compression are needed.
One signal compression technique, referred to as perceptual coding, employs the idea of distortion or noise masking in which the distortion or noise is masked by the input signal. The masking occurs because of the inability of the human perceptual mechanism to distinguish two signal components (one belonging to the signal and one belonging to the noise) in the same spectral, temporal, or spatial locality under some conditions. An important effect of this limitation is that the perceptibility (or loudness) of noise (e.g., quantizing noise) can be zero even if the objectively measured local signal-to-noise ratio is low. Additional details concerning perceptual coding techniques may be found in N. Jayant et al., “Signal Compression Based on Models of Human Perception,” Proceedings of the IEEE, Vol. 81, No. 10, October 1993.
U.S. Pat. No. 5,341,457 discloses a perceptual coding technique in which a perceptual audio encoder is used to convert the audio signal (or a function thereof) into a measure of predictability (e.g., a spectral flatness measure) and then into a tonality metric from which a noise to mask ratio can be calculated, using knowledge provided by controlled subjective testing of the masking properties of tones and noise. Other techniques calculate the tonality metric from a loudness or loudness uncertainty calculation. These known perceptual coding techniques are either computationally inefficient, provide incorrect noise to mask ratios for some kinds of audio signal, or both.
Accordingly, it is desirable to provide a perceptual coding technique that reduces the complexity of the required computations while increasing the accuracy of the resulting noise to mask ratios.
SUMMARY OF THE INVENTION
The inventor has determined that accurate perceptual coding does not require a measure of tonality. Rather, perceptual coding is accomplished by measuring the envelope roughness of the filtered audio signal, which may be directly converted to the noise to mask threshold needed to calculate the perceptual threshold or “just noticeable difference”. Thus, the present invention does not require any complex calculations to determine tonality, either by a measure of predictability or by the calculation of a loudness or loudness uncertainty. Instead, the envelope roughness of the signal is simply reduced directly to the noise to mask ratio.
REFERENCES:
patent: 4896362 (1990-01-01), Veldhuis et al.
patent: 5105463 (1992-04-01), Veldhuis et al.
patent: 5136377 (1992-08-01), Johnston et al.
patent: 5161210 (1992-11-01), Druyvesteyn et al.
patent: 5471558 (1995-11-01), Tsutsui
patent: 5550924 (1996-08-01), Helf et al.
patent: 5553193 (1996-09-01), Akagiri
patent: 5583967 (1996-12-01), Akagiri
patent: 5682463 (1997-10-01), Allen et al.
patent: 5684920 (1997-11-01), Iwakami et al.
patent: 5699479 (1997-12-01), Allen et al.
patent: 5864820 (1999-01-01), Case
patent: 5890125 (1999-03-01), Davis et al.
patent: 5911128 (1999-06-01), DeJaco
“Information technology—Coding of moving pictures and associated audio for digital storage media at up to about 1,5 Mbit/s-”, Part 3: Audio, ISO/IEC 11172-3, 1993.
AT&T Corp.
Opsasnick Michael N.
Tsang Fan
LandOfFree
Perceptual coding of audio signals employing envelope... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Perceptual coding of audio signals employing envelope..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Perceptual coding of audio signals employing envelope... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2990246