Audio coding method and apparatus

Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission

Reexamination Certificate

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details Audio coding method and apparatus Audio coding method and apparatus

: 1998-03-06
: 2004-04-13
: Abebe, Daniel (Department: 2655)
: Data processing: speech signal processing, linguistics, language
: Speech signal processing
: For storage or transmission

: C704S230000
: Reexamination Certificate
: active
: 06721700
: ABSTRACT:

FIELD OF THE INVENTION
The present invention relates to a method and apparatus for audio coding and to a method and apparatus for audio decoding.
BACKGROUND OF THE INVENTION
It is well known that the transmission of data in digital form provides for increased signal to noise ratios and increased information capacity along the transmission channel. There is however a continuing desire to further increase channel capacity by compressing digital signals to an ever greater extent. In relation to audio signals, two basic compression principles are conventionally applied. The first of these involves removing the statistical or deterministic redundancies in the source signal whilst the second involves suppressing or eliminating from the source signal elements with are redundant insofar as human perception is concerned. Recently, the latter principle has become predominant in high quality audio applications and typically involves the separation of an audio signal into its frequency components (sometimes called “sub-bands”), each of which is analysed and quantised with a quantisation accuracy determined to remove data irrelevancy (to the listener). The ISO (International Standards Organisation) MPEG (Moving Pictures Expert Group) audio coding standard and other audio coding standards employ and further define this principle. However, MPEG (and other standards) also employs a technique know as “adaptive prediction” to produce a further reduction in data rate.
The operation of an encoder according to the new MPEG-2 AAC standard is described in detail in the draft International standard document ISO/IEC DIS 13818-7. This new MPEG-2 standard employs backward linear prediction with 672 of 1024 frequency components. It is envisaged that the new MPEG-4 standard will have similar requirements. However, such a large number of frequency components results in a large computational overhead due to the complexity of the prediction algorithm and also requires the availability of large amounts of memory to store the calculated and intermediate coefficients. It is well known that when backward adaptive predictors of this type are used in the frequency domain, it is difficult to further reduce the computational loads and memory requirements. This is because the number of predictors is so large in the frequency domain that even a very simple adaptive algorithm still results in large computational complexity and memory requirements. Whilst it is known to avoid this problem by using forward adaptive predictors which are updated in the encoder and transmitted to the decoder, the use of forward adaptive predictors in the frequency domain inevitably results in a large amount of “side” information because the number of predictors is so large.
It is an object to the present invention to overcome or at least mitigate the disadvantages of known prediction methods.
This and other objects are achieved by coding an audio signal using error signals to remove redundancy in each of a plurality of frequency sub-bands of the audio signal and in addition generating long term prediction coefficients in the time domain which enable a current frame of the audio signal to be predicted from one or more previous frames.
SUMMARY OF THE INVENTION
According to a first aspect of the present invention there is provided a method of coding an audio signal, the method comprising the steps of:
receiving an audio signal x to be coded;
generating a quantised audio signal {tilde over (x)} from the received audio signal x;
generating a set of long-term prediction coefficients A which can be used to predict a current time frame of the received audio signal x directly from at least one previous time frame of the quantised audio signal {tilde over (x)};
using the prediction coefficients A to generate a predicted audio signal {circumflex over (x)};
comparing the received audio signal x with the predicted audio signal {circumflex over (x)} and generating an error signal E(k) for each of a plurality of frequency sub-bands;
quantising the error signals E(k) to generate a set of quantised error signals {tilde over (E)}(k); and
combining the quantised error signal {tilde over (E)}(k) and the prediction coefficients A to generate a coded audio signal.
The present invention provides for compression of an audio signal using a forward adaptive predictor in the time domain. For each time frame of a received signal, it is only necessary to generate and transmit a single set of forward adaptive prediction coefficients for transmission to the decoder. This is in contrast to known forward adaptive prediction techniques which require the generation of a set of prediction coefficients for each frequency sub-band of each time frame. In comparison to the prediction gains obtained by the present invention, the side information of the long term predictor is negligible.
Certain embodiments of the present invention enable a reduction in computational complexity and in memory requirements. In particular, in comparison to the use of backward adaptive prediction, there is no requirement to recalculate the prediction coefficients in the decoder. Certain embodiments of the invention are also able to respond more quickly to signal changes than conventional backward adaptive predictors.
In one embodiment of the invention, the received audio signal x is transformed in frames x
m
from the time domain to the frequency domain to provide a set of frequency sub-band signals X(k). The predicted audio signal {circumflex over (x)} is similarly transformed from the time domain to the frequency domain to generate a set of predicted frequency sub-band signals {circumflex over (X)}(k) and the comparison between the received audio signal x and the predicted audio signal {circumflex over (x)} is carried out in the frequency domain, comparing respective sub-band signals against each other to generate the frequency sub-band error signals E(k). The quantised audio signal {tilde over (x)} is generated by summing the predicted signal and the quantised error signal, either in the time domain or in the frequency domain.
In an alternative embodiment of the invention, the comparison between the received audio signal x and the predicted audio signal {circumflex over (x)} is carried out in the time domain to generate an error signal e also in the time domain. This error signal e is then converted from the time to the frequency domain to generate said plurality of frequency sub-band error signals E(k).
Preferably, the quantisation of the error signals is carried out according to a psycho-acoustic model.
According to a second aspect of the present invention there is provided a method of decoding a coded audio signal, the method comprising the steps of:
receiving a coded audio signal comprising a quantised error signal {tilde over (E)}(k) for each of a plurality of frequency sub-bands of the audio signal and, for each time frame of the audio signal, a set of prediction coefficients A which can be used to predict a current time frame x
m
of the received audio signal directly from at least one previous time frame of a reconstructed quantised audio signal {tilde over (x)};
generating said reconstructed quantised audio signal {tilde over (x)} from the quantised error signals {tilde over (E)}(k);
using the prediction coefficients A and the quantised audio signal {tilde over (x)} to generate a predicted audio signal {circumflex over (x)};
transforming the predicted audio signal {circumflex over (x)} from the time domain to the frequency domain to generate a set of predicted frequency sub-band signals {circumflex over (X)}(k) for combining with the quantised error signals {tilde over (E)}(k) to generate a set of reconstructed frequency sub-band signals {tilde over (X)}(k); and
performing a frequency to time domain transform on the reconstructed frequency sub-band signals {tilde over (X)}(k) to generate the reconstructed quantised audio signal {tilde over (x)}.
Embodiments of the above second aspect of the invention are particularly applicable where only a sub-set of all possible quantised error signals {

Affiliated with

Yin Lin

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

Abebe Daniel

Examiner

[ 0.00 ] – not rated yet Voters 0 Comments 0

Nokia Mobile Phones Limited

Corporate Assignee

[ 0.00 ] – not rated yet Voters 0 Comments 0

Perman & Green LLP

Law Firm

[ 0.00 ] – not rated yet Voters 0 Comments 0

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Audio coding method and apparatus does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Audio coding method and apparatus, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Audio coding method and apparatus will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFUS-PAI-O-3254104

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure