Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Patent
1997-10-28
1999-12-21
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
704205, 704229, G10L 704
Patent
active
060061790
ABSTRACT:
An audio coder/decoder ("codec") that is suitable for real-time applications due to reduced computational complexity, and a novel adaptive sparse vector quantization (ASVQ) scheme and algorithms for general purpose data quantization. The codec provides low bit-rate compression for music and speech, while being applicable to higher bit-rate audio compression. The codec includes an in-path implementation of psychoacoustic spectral masking, and frequency domain quantization using the novel ASVQ scheme and algorithms specific to audio compression. More particularly, the inventive audio codec employs frequency domain quantization with critically sampled subband filter banks to maintain time domain continuity across frame boundaries. The input audio signal is transformed into the frequency domain in which in-path spectral masking can be directly applied. This in-path spectral masking usually results in sparse vectors. The ASVQ scheme is a vector quantization algorithm that is particularly effective for quantizing sparse signal vectors. In the preferred embodiment, ASVQ adaptively classifies signal vectors into six different types of sparse vector quantization, and performs quantization accordingly. The ASVQ technique applies to general purpose data quantization as well as to quantization in the context of audio compression. The invention also includes a "soft clipping" algorithm in the decoder as a post-processing stage. The soft clipping algorithm preserves the waveform shapes of the reconstructed time domain audio signal in a frame- or block-oriented stateless manner while maintaining continuity across frame or block boundaries. The invention includes related methods, apparatus, and computer programs.
REFERENCES:
patent: 4811398 (1989-03-01), Copperi et al.
patent: 4868867 (1989-09-01), Davidson et al.
patent: 5371544 (1994-12-01), Jacquin et al.
patent: 5388181 (1995-02-01), Anderson et al.
patent: 5596676 (1997-01-01), Swaminathan et al.
Pamela C. Cosman, Robert M. Gray, and Martin Vetterli, "Vector Quantization of Image Subbands: A Survey", IEEE Trans. on Image Processing, vol. 5, No. 2, pp. 202-225, Feb. 1996.
Collected Papers on Digital Audio Bit Rate Reduction, Neil Gilchrist and Christer Grewin, Eds., Audio ES, Jun. 1996.
Mantegna John
Wu Shuwu
America Online Inc.
Hudspeth David R.
Smits Talivaldis Ivars
LandOfFree
Audio codec using adaptive sparse vector quantization with subba does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Audio codec using adaptive sparse vector quantization with subba, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Audio codec using adaptive sparse vector quantization with subba will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-515577