Method for independent dynamic range control

Data processing: speech signal processing – linguistics – language – Audio signal bandwidth compression or expansion

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S501000, C704S224000, C704S230000

Reexamination Certificate

active

06782366

ABSTRACT:

CROSS-REFERENCE TO SOURCE CODE APPENDIX
Appendix A, which is part of the present disclosure, contains assembly code for a digital signal processor for implementing one embodiment of this invention as described more completely below.
A portion of the present disclosure of this patent document contains material that is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent files or records, but otherwise reserves all copyright rights whatsoever.
BACKGROUND
Active Coding-3, also known as “AC-3” or “Dolby Digital,” is a digital audio standard described in “Digital Audio Compression (AC-3)” by the United States Advanced Television Systems Committee, which is hereby incorporated by reference in its entirety.
A prior art AC-3 bitstream is illustrated in
FIG. 1A
, which is reproduced from FIG. 5.1 of “Digital Audio Compression (AC-3)” referenced above. The AC-3 bitstream is made up of a sequence of synchronization frames. Each synchronization frame contains 6 coded audio blocks (“AB”), each of which represents 256 new audio samples. A synchronization information (“SI”) header at the beginning of each frame contains information needed to acquire and maintain synchronization. A bitstream information (“BSI”) header follows the SI header, and contains parameters describing the coded audio service. An auxiliary data (“AUX”) field may follow the coded audio blocks. At the end of each frame is an error check field that includes a cyclical redundancy check (“CRC”) word for error detection. An optional CRC word is also located in the SI header.
AC-3 provides a dynamic range control system that allows program providers, such as movie studios, to control the dynamic range of their audio programs. Dynamic range refers to the range of the relative sound levels in an audio program. For example, dialogues are usually used as a reference where loud sounds are certain decibels above the dialogue sound level while soft sounds are certain decibels below the dialogue sound level.
Program providers can encode dynamic range gain words (“dynrng”) (e.g., 8 bits) in the audio blocks to alter the gain of the audio blocks. The dynrng values typically indicate decibel (“dB”) gain reduction during the loudest signal passages, and dB gain increases during the quite passages. AC-3 provides that AC-3 decoders shall implement the compression characteristics indicated by the dynrng values encoded in the audio blocks. AC-3 further provides that AC-3 decoders may optionally allow listener control over the use of the dynrng values so that the listener may select full dynamic range reproduction by ignoring the dynrng values or partial dynamic range reproduction by using some fraction of the dynrng values.
Program providers can also encode compression gain words (“compr”) (e.g., 8 bits) in the BI header to alter the gain of the audio frames. The compr values provide larger dynamic range reductions (also known as “heavy compression”) than the dynrng values. The compr values have twice the control range as the dynrng values (±48 dB vs. ±24 dB) with half the resolution (0.5 dB v. 0.25 dB).
AC-3 decoders may provide both 2-channel and m-channel outputs (m>0; e.g., m=6). In some applications, consumers may desire independent dynamic range control for the 2-channel and the m-channel outputs. To provide independent dynamic range control for the 2-channel and the m-channel outputs, AC-3 decoders can (1) execute the decoding algorithm with one set of dynamic range gain words (or compression gain words) for the 2-channel output, and (2) execute the decoding algorithm again with another set of dynamic range gain words (or compression gain words) for the m-channel output. This method is inefficient because actions that are computational and/or memory intensive are repeated. Thus, what is needed is a method that provides independent dynamic range control for 2-channel and m-channel outputs while minimizing the repetition of computational and/or memory intensive actions.
SUMMARY
In accordance with one aspect of the invention, independent dynamic range control are provided for 2-channel and m-channel outputs without repeating computational and/or memory intensive actions including the inverse transform and the windowing of audio samples.
In one embodiment, m-channel dynamic range control is conventionally applied to the m-channel audio samples in the frequency domain to form m-channel frequency samples. The m-channel frequency samples are inverse transformed to generate audio samples in the time domain (“m-channel time samples”) and windowed to generate windowed time samples (i.e., the m-channel output).
2-channel dynamic range control is applied to the m-channel audio samples in the time domain after windowing instead of the m-channel audio samples in the frequency domain prior to the inverse transform, thereby avoiding repeating the inverse transform and the windowing of the m-channel audio samples. To do this, the m-channel output is divided into groups and each group is multiplied with a corresponding 2-channel dynamic range scale factor. The 2-channel dynamic range scale factors at least partially remove the effects of the m-channel dynamic range control applied in the frequency domain and the windowing in the time domain, and readjust the dynamic range of the m-channel output for 2-channel output. These audio samples are then downmixed to form the 2-channel output.
In another embodiment, the 2-channel and m-channel outputs are generated without repeating the inverse transform. The m-channel dynamic range control is conventionally applied to the m-channel frequency samples. The m-channel frequency samples are inverse transformed to generate the m-channel time samples.
The m-channel time samples are duplicated to two sets. In the first set, the m-channel time samples are conventionally windowed to generate windowed time samples (i.e., the m-channel output). In the second set, a 2-channel dynamic range final scale is multiplied with the m-channel time samples. The 2-channel final scale at least partially removes the effects of the m-channel dynamic range control applied in the frequency domain and readjusts dynamic range of the m-channel time samples for 2-channel output. The second set is then windowed to generate windowed time samples and downmixed to form the 2-channel output.


REFERENCES:
patent: 5633981 (1997-05-01), Davis
patent: 6351733 (2002-02-01), Saunders et al.
patent: 6487535 (2002-11-01), Smyth et al.
patent: 6516299 (2003-02-01), Case
Digital Audio Compression Standard (AC-3),Advanced Television Systems Committee(1995).
Chapter 6 Instruction Set,Functional Specification Copyright© by LSI Logic Corporation(1997).
Digital Audio Compression Standard (AC-3), Advanced Television Systems Committe (1995).

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method for independent dynamic range control does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method for independent dynamic range control, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for independent dynamic range control will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3356727

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.