Frequency-domain scalable coding without upsampling filters

Data processing: speech signal processing – linguistics – language – Audio signal bandwidth compression or expansion

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S205000, C704S230000

Reexamination Certificate

active

06370507

ABSTRACT:

FIELD OF THE INVENTION
The present invention relates to methods of and apparatus for coding discrete signals and decoding coded discrete signals, respectively, and in particular to implementing differential coding for scalable audio coders in efficient manner.
BACKGROUND ART AND DESCRIPTION OF PRIOR ART
Scalable audio coders are coders of modular construction. There are endeavors to employ existing speech coders capable of processing signals, which are sampled e.g. with 8 kHz, and of outputting data rates of, for example, 4.8 to 8 kilobit per second. These known coders, such as e.g. the coders G.729, G.723, FS1016 and CELP known to experts, serve mainly for coding speech signals and in general are not suitable for coding higher-quality music signals since they are usually designed for signals sampled with 8 kHz, so that they can code only an audio bandwidth of 4 kHz at maximum. However, in general they exhibit faster operation and low calculating expenditure.
For audio coding of music signals, in order to obtain for example HIFI quality or CD quality, a scalable coder thus employs a combination of a speech coder and an audio coder that is capable of coding signals with a higher sampling rate, such as e.g. 48 kHz. It is of course also possible to replace the above-mentioned speech coder by a different coder, for example a music/audio coder according to the standards MPEG1, MPEG2 or MPEG3.
Such a cascade connection of a speech coder with a higher-grade audio coder usually employs the method of differential coding in the time domain. An input signal having e.g. a sampling rate of 48 kHz is downsampled to the sampling frequency suitable for the speech coder by means of a downsampling filter. The downsampled signal is then coded. The coded signal can be fed directly to a bit stream formatting means for transmission thereof. However, it contains only signals with a bandwidth of e.g. 4 kHz at maximum. The coded signal, furthermore, is decoded again and upsampled by means of an upsampling filter. However, due to the downsampling filter, the signal then obtained contains only useful information with a bandwidth of e.g. 4 kHz. Furthermore, it is to be noted that the spectral content of the upsampled coded/decoded signal in the lower band range up to 4 kHz does not correspond exactly to the first 4 kHz band of the input signal sampled with 48 kHz, since coders in general introduce coding errors (cf. “First Ideas on Scalable Audio Coding”, K. Brandenburg, B. Grill, 97th AES-Convention, San Francisco, 1994, Preprint 3924).
As was already pointed out, a scalable coder comprises both a generally known speech coder and an audio coder that is capable of processing signals with higher sampling rates. In order to be able to transmit signal components of the input signal having frequencies above 4 kHz, a difference is formed of the input signal with 8 kHz and the coded/decoded upsampled output signal of the speech coder for each individual time-discrete sampled value. This difference then may be quantized and coded by means of a known audio coder, as known to experts. It is to be noted here that the differential signal fed into the audio coder capable of coding signals with higher sampling rates, is substantially zero in the lower frequency range, leaving apart coding errors of the speech coder. In the spectral range above the bandwidth of the upsampled coded/decoded output signal of the speech coder, the differential signal substantially corresponds to the true input signal at 48 kHz.
In the first stage, i.e. the stage of the speech coder, a coder with low sampling frequency is thus used mostly, since in general a very low bit rate of the coded signal is aimed at. At present, there are several coders, also the coders mentioned, operating with bit rates of a few kilobit (two to eight kilobit or also above). The same coders, furthermore, permit a maximum sampling frequency of 8 kHz, since a greater audio bandwidth is not possible anyway with such a low bit rate and since coding with a low sampling frequency is more advantageous as regards the calculating expenditure. The maximum possible audio bandwidth is 4 kHz and in practical application is restricted to about 3.5 kHz. In case a bandwidth improvement is to be achieved then in the additional stage, i.e. in the stage including the audio coder, this additional stage will have to operate with a higher sampling frequency.
For matching the sampling frequencies, decimation and interpolation filters are used for downsampling and upsampling, respectively. As FIR filters (FIR=Finite Impulse Response) are used in general for obtaining an advantageous phase behavior, filter arrangements of several hundred coefficients or “taps” can be required e.g. for matching from 8 kHz to 48 kHz.
SUMMARY OF THE INVENTION
It is the object of the present invention to provide methods of an apparatus for coding discrete signals and decoding coded discrete signals, respectively, which are capable of operating without complex upsampling filters.
This object is met by a method of coding according to claim
1
, a method of decoding according to claim
13
, an apparatus for coding according to claim
14
, and an apparatus for decoding according to claim
15
.
In accordance with a first aspect of the present invention, the object is met by a method of coding discrete first time signals sampled with a first sampling rate, by firstly generating second time signals, having a bandwidth corresponding to a second sampling rate, from the first time signals, with the second sampling rate being lower than the first sampling rate, secondly, coding the second time signals in accordance with a first coding algorithm in order to obtain coded second signals, third, decoding the coded second signals in accordance with the first coding algorithm in order to obtain coded/decoded second time signals having a bandwidth corresponding to the second sampling frequency, fourth, transforming the first time signals to the frequency domain to obtain first spectral values, fifth, generating second spectral values from the coded/decoded second time signals, the second spectral values being a representation of the coded/decoded second time signals in the frequency domain and having a time and frequency resolution substantially equal to the first spectral values, sixth, weighting the first spectral values with the second spectral values in order to obtain weighted spectral values which in number correspond to the number of the first spectral values, and coding the weighted spectral values in accordance with a second coding algorithm in order to obtain coded weighted spectral values.
Weighting the first spectral values and the second spectral values comprises the subtraction of the second spectral values from the first spectral values in to obtain differential spectral values.
In accordance with a second aspect of the present invention the above object is met by a method of decoding a coded discrete signal, by firstly decoding coded second signals to obtain coded/decoded second discrete time signals, with a first coding algorithm, secondly, decoding coded weighted spectral values with a second coding algorithm, to obtain weighted spectral values, thirdly, transforming the coded/decoded second discrete time signals to the frequency domain in order to obtain second spectral values, fourth, inversely weighting the weighted spectral values and the second spectral values to obtain first spectral values and retransforming the first spectral values to the time domain in order to obtain first discrete time signals.
In accordance with a third aspect of the present invention the above object is met by an apparatus for coding discrete first time signals sampled with a first sampling rate. The apparatus comprises several parts, such as, a generating device for generating second time signals, having a bandwidth corresponding to a second sampling rate, from the first time signals, with the second sampling rate being lower than the first sampling rate, a first coder for coding the second time signals in accordance wi

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Frequency-domain scalable coding without upsampling filters does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Frequency-domain scalable coding without upsampling filters, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Frequency-domain scalable coding without upsampling filters will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2829508

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.