System and method for efficient time-domain aliasing...

Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission

Reexamination Certificate

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details System and method for efficient time-domain aliasing... System and method for efficient time-domain aliasing...

: 1999-02-26
: 2002-08-06
: Chawan, Vijay B (Department: 2654)
: Data processing: speech signal processing, linguistics, language
: Speech signal processing
: For storage or transmission

: C704S200100, C704S204000, C704S205000, C704S500000, C375S240000, C708S402000, C708S401000, C708S400000
: Reexamination Certificate
: active
: 06430529
: ABSTRACT:

BACKGROUND OF THE INVENTION
1. Field of the Invention
This invention relates generally to improvements in digital audio processing, and relates specifically to a system and method for implementing an efficient time-domain aliasing cancellation in digital audio encoding.
2. Description of the Background Art
Digital audio is now in widespread use in digital video disk (DVD) players, digital satellite systems (DSS), and digital television (DTV). A problem in all of these systems is the limitation of either storage capacity or bandwidth, which may be viewed as two aspects of a common problem. In order to fit more digital audio in a storage device of limited storage capacity, or to transmit digital audio over a channel of limited bandwidth, some form of digital audio compression is required. One commonly used form of compression is perceptual encoding, where models based upon human hearing allow for removing information corresponding to sounds that will not be perceived by a human.
The Advanced Television Systems Committee (ATSC) selected the Dolby® Labs design for perceptual encoding for use in the Digital Television (DTV) system (formerly known as HDTV). This design is set forth in the Audio Compression version 3 (AC-3) specification ATSC A/52 (hereinafter “the AC-3 specification”), which is hereby incorporated by reference. The AC-3 specification has been subsequently selected for Region 1 (North American market) DVD and DSS broadcast.
The AC-3 specification gives a standard decoder design for digital audio, which allows all AC-3 encoded digital audio recordings to be reproduced by differing vendors' equipment. In contrast, the specifics of the AC-3 audio encoding process are not normative requirements of the AC-3 standard. Nevertheless, the encoder must produce a bitstream matching the syntax in the standard, which, when decoded, produces audio of sufficient quality for the intended application. Therefore, many of the encoder design details may be left to the individual designer without affecting the ability of the resulting encoded digital audio to be reproduced with the standard decoder design. It is usually more efficient to compress the audio data in the frequency domain rather than in the time domain. One way to perform the conversion from time domain to frequency domain is the modified discrete cosine transform (MDCT), which is one form of a discrete Fourier transform acting upon a function of a discrete variable. The MDCT is often used to convert input data sequences of discrete variables called time-domain data samples into output data sequences of discrete variables called frequency-domain coefficients. The time-domain data samples represent the measured values of the incoming audio data at discrete time values, and the frequency-domain coefficients represent the corresponding signal strengths at discrete frequency values.
In order to achieve high-fidelity audio when the encoded signals are later decoded during playback, the AC-3 specification adopted a method called time-domain aliasing cancellation (TDAC). The TDAC method may allow the near-perfect reconstruction of the original audio when encoded audio data is subsequently decoded for playback. The TDAC method includes two processes: a properly-chosen windowing operation using multiplication by windowing coefficients, followed by a MDCT.
An important design decision in a perceptual encoding standard is the number of digital samples transformed at a time in an MDCT, called the block-length of the MDCT. When transients (rapid fluctuations in values in a sequence of time-domain samples) are not observed, block switch flag blksw is set equal to 0, and an AC-3 encoder designed for TDAC switches to long-block MDCT calculations of 512 samples. When transients are observed, block switch flag blksw is set equal to 1, and the encoder switches to pairs of short-block MDCT calculations of 256 samples. A longer block-length increases frequency resolution but lowers time resolution. A longer block transform is usually adopted when the signal is relatively stable. A shorter block transform is adopted when the signal is relatively unstable to prevent pre-echoing effects. Therefore, rather than select a single MDCT block-length, an encoder designed for TDAC switches between MDCT block-lengths of 512 samples and 256 samples in order to maximize fidelity as audio circumstances require.
The AC-3 specification gives a basic equation for the calculation of the encoder MDCT. However, directly calculating the MDCT using the basic equation requires inordinate amounts of processor power, which prevents the implementation of an encoder with practical, cost-effective processing components. Optimizing the calculations for the MDCT for the different block-lengths is therefore an issue in the efficient design of AC-3 encoders.
SUMMARY OF THE INVENTION
The present invention includes a system and method for an efficient time-domain aliasing cancellation (TDAC) in digital audio encoding. In one embodiment, the present invention comprises an improved modified discrete cosine transform (MDCT) method for efficient perceptive encoding compression of digital audio in Dolby® Digital AC-3 format. In alternate embodiments, the improved MDCT method may be used in other perceptive encoding formats.
One embodiment of the present invention utilizes complex-valued premultiplication and complex-valued postmultiplication steps which prepare and arrange the data samples so that both the long-block and short-block transforms may be efficiently performed. The premultiplication and postmultiplication steps are carefully structured to work with discrete Fourier transforms (DFT) in a manner which will give the same numeric results as would be achieved with a direct calculation of the MDCT. However, the complex-valued premultiplication, DFT, and complex-valued postmultiplication steps together require many fewer calculation steps than the direct calculation of the MDCT. In this manner, the present invention facilitates the use of consumer-oriented digital signal processors (DSP) of reduced computational power, which in turn reduces the cost for practical implementations.

REFERENCES:
patent: 5230038 (1993-07-01), Fielder et al.
patent: 5297236 (1994-03-01), Antill et al.
patent: 5363096 (1994-11-01), Duhamel et al.
patent: 5727119 (1998-03-01), Davidson et al.
patent: 5781888 (1998-07-01), Herre
patent: 5857000 (1999-01-01), Jar-Ferr et al.
patent: 5890106 (1999-03-01), Bosi-Goldberg
patent: 6119080 (2000-09-01), Liu et al.
patent: 6119038 (2001-03-01), Tsutsui
patent: 6209015 (2001-03-01), Jhung
patent: 9222137 (1992-10-01), None
Lau et al., (“A common transform engine for MPEG and AC3 audio decoder”, IEEE transactions on Consumer Electronics, Jun. 1997, vol. 43, Issue 3, pp. 559-566).*
Vetterli et al., (“Split-radix algorithms for length-p/sup m/DFTs”, 1988 International Conference on Acoustics, Speech, and Signal Processing, 1988, ICASSP-88, vol. 3, Apr. 1988, pp. 1415-1418).*
Duhamel(“Algorithms meeting the lower bounds on the multiplicative complexity of Length-2n DFT's and their connection with practical algorithms”, Transactions ICASSP-90, Sep. 1990, vol. 38, Issue 9, pp. 1504-1514).*
Jhung et al., (“Architecture of Dual mode audio filter for AC-3 and MPEG”, IEEE Transactions on Consumer Electronics, vol. 43, Issue 3, Aug. 1997, pp. 575-585).*
Szu-Wei et al. “Transformation from 512-point transform coefficients to 256-point transform coefficients for Dolby AC-3 decoder” vol. 35, No. 19, 16/9/99, pp. 1614-1615.
Todd, Craig C. et al., “AC-3: Flexible Perceptual Coding for Audio Transmission and Storage,” presented at Audio Engineering Society Convention, Amsterdam, Feb. 26—Mar. 1, 1994, pp. 1-16.
Advanced Television Systems Committee, “Digital Audio Compression Standard (AC-3),” ATSC Document A/52, Dec. 20, 1995, pp. 1-130.

Affiliated with

Huang Shay-Jan

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

Chawan Vijay B

Examiner

[ 0.00 ] – not rated yet Voters 0 Comments 0

Koerner Gregory J.

Attorney

[ 0.00 ] – not rated yet Voters 0 Comments 0

Simon & Koerner LLP

Law Firm

[ 0.00 ] – not rated yet Voters 0 Comments 0

Sony Corporation

Corporate Assignee

[ 0.00 ] – not rated yet Voters 0 Comments 0

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System and method for efficient time-domain aliasing... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with System and method for efficient time-domain aliasing..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for efficient time-domain aliasing... will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFUS-PAI-O-2949931

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure