Data processing: speech signal processing – linguistics – language – Speech signal processing – Psychoacoustic
Reexamination Certificate
2005-09-20
2005-09-20
Knepper, David D. (Department: 2654)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Psychoacoustic
Reexamination Certificate
active
06947886
ABSTRACT:
Disclosed are scalable quantizers for audio and other signals characterized by a non-uniform, perception-based distortion metric, that operate in a common companded domain which includes both the base-layer and one or more enhancement-layers. The common companded domain is designed to permit use of the same unweighted MSE metric for optimal quantization parameter selection in multiple layers, exploiting the statistical dependence of the enhancement-layer signal on the quantization parameters used in the preceding layer. One embodiment features an asymptotically optimal entropy coded uniform scalar quantizer. Another embodiment is an improved bit rate scalable multi-layer Advanced Audio Coder (AAC) which extends the scalability of the asymptotically optimal entropy coded uniform scalar quantizer to systems with non-uniform base-layer quantization, selecting the enhancement-layer quantization methodology to be used in a particular band based on the preceding layer quantization coefficients. In the important case that the source is well modeled as Laplacian, the optimal conditional quantizer is implementable by only two distinct switchable quantizers depending on whether or not the previous quantizer identified the band in question as a so-called “zero dead-zone:” Hence, major savings in bit rate are recouped at virtually no additional computational cost. For example, the proposed four layer scalable coder consisting of 16 kbps layers achieves performance close to a 60 kbps non-scalable coder on the standard test database of 44.1 kHz audio.
REFERENCES:
patent: 5612900 (1997-03-01), Azadegan et al.
patent: 5734679 (1998-03-01), Nomura
patent: 5774844 (1998-06-01), Akagiri
patent: 6009387 (1999-12-01), Ramaswamy et al.
patent: 6029126 (2000-02-01), Malvar
patent: 6098039 (2000-08-01), Nishida
patent: 6108626 (2000-08-01), Cellario et al.
patent: 6349284 (2002-02-01), Park et al.
patent: 2003/0058931 (2003-03-01), Zhang et al.
patent: 2003/0212551 (2003-11-01), Rose et al.
“Towards Weighted Mean-Squared Error Optimality of Scalable Audio Coding”, a dissertation submitted in partial satisfaction of requirements for the degree Doctor of Philosophy in Electrical and Computer Engineering by Ashish Aggarwal, dated Dec. 2002.
Article in Audio Engineering Society entitled “Compander Domain Approach to Scalable AAC”, by Ashish Aggarwal, Shankar Regunathan and Kenneth Rose, University of Calfornia, presented at the 110thConvention May 12-15, 2001 Amsterdam.
Article “A Conditional Enhancement-Layer Quantizer For the Advanced Audio Coder”, by Ashish Aggarwal and Kenneth Rose, ICASSP 2002.
Article entitled “Asympototically Optimal Scalable Coding for Minimum Weighted Mean Square Error”, by Ashish Aggarwal, Shankar Regunathan and Kenneth Rose, Data Compression Conference, Nov. 15, 2000.
Aggarwal Ashish
Regunathan Shankar L.
Rose Kenneth
Fulbright & Jaworski
Knepper David D.
The Regents of the University of California
LandOfFree
Scalable compression of audio and other signals does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Scalable compression of audio and other signals, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Scalable compression of audio and other signals will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3446099