Coded data generation or conversion – Digital code to digital code converters
Reexamination Certificate
1998-05-07
2002-03-12
JeanPierre, Peguy (Department: 2819)
Coded data generation or conversion
Digital code to digital code converters
C704S500000
Reexamination Certificate
active
06356211
ABSTRACT:
BACKGROUND OF THE INVENTION
1. Field of the Invention
This invention relates to an encoding method and apparatus, suitable for encoding input signals by high efficiency encoding and for reproducing playback signals on transmission, recording, reproduction and decoding, and a recording medium.
2. Description of the Related Art
There has so far been proposed an information recording medium capable of recording signals such as the encoded acoustic information or the music information (referred to hereinafter as audio signals), such as a magneto-optical disc. Among methods for high-efficiency encoding of the audio signals, there are a so-called transform coding which is a blocking frequency spectrum splitting method of transforming a time-domain signal into frequency domain signals by orthogonal transform and encoding the spectral components from one frequency band to another, and a sub-band encoding (SBC) method, which is a non-blocking frequency spectrum splitting method of splitting the time-domain audio signals into plural frequency bands without blocking and encoding the resulting signals of the frequency bands. There is also known a high-efficiency encoding technique which is a combination of the sub-band coding and transform coding, in which case the time domain signals are split into plural frequency bands by SBC and the resulting band signals are orthogonal transformed into spectral components which are encoded from band to band.
Among the above-mentioned filters is a so-called QMF (Quadrature Mirror Filter) as discussed in R.E. Crochiere, Digital Coding of Speech in subbands, Bell Syst. Tech. J. Vol.55, No.8, 1976. This QMF filter splits the frequency spectrum into two bands of equal bandwidths and is characterized in that so-called aliasing is not produced on subsequently synthesizing the split bands. The technique of dividing the frequency spectrum is discussed in Joseph H. Rothweiler, Polyphase Quadrature Filters- A New Subband Coding Technique, ICASSP 83 BOSTON. This polyphase quadrature filter is characterized in that the signal can be split at a time into plural bands of equal band-width.
Among the above-mentioned techniques for orthogonal transform is such a technique in which an input audio signal is blocked every pre-set unit time, such as every frame, and discrete fourier transform (DFT), discrete cosine transform (DCT) or modified DCT (MDCT) is applied to each block for converting the signals from the time axis to the frequency axis. Discussions of the MDCT are found in J. P. Princen and A. B. Bradley, Subband/Transform coding Using Filter Bank Based on Time Domain Aliasing Cancellation, ICASSP 1987.
If the above-mentioned DFT or DCT is used as a method for transforming waveform signals into spectral signals, and transform is applied based on a time block composed of M samples, M independent real-number data are obtained. It is noted that, for reducing junction distortions between time blocks, a given time bock is usually overlapped with MI samples with both neighboring blocks, and M real-number data on an average are quantized and encoded in DFT or DCT for (M−M
1
) samples. It is these M real-number data that are subsequently quantized and encoded.
On the other hand, if the above-mentioned MDCT is used as a method for orthogonal transform, M independent real-number data are obtained from 2M samples overlapped with M samples of both neighboring time blocks. Thus, in MDCT, M real-number data on an average are obtained for M samples and subsequently quantized and encoded. A decoding device adds waveform elements obtained on inverse transform in each block from the codes obtained by MDCT with interference for re-constructing the waveform signals.
In general, if a time block for transform is lengthened, the spectrum frequency resolution is improved such that the signal energy is concentrated in specified frequency components. Therefore, by using MDCT in which, by overlapping with one half of each of both neighboring blocks, transform is carried out with long block lengths, and in which the number of the resulting spectral signals is not increased beyond the number of the original time samples, encoding can be carried out with higher efficiency than if DFT or DCT is used. Moreover, since the neighboring blocks have sufficiently long overlap with each other, the inter-block distortion of the waveform signals can be reduced. However, if the transform block length for transform is lengthened, more work area is required for transform, thus obstructing reduction in size of reproducing means. In particular, use of a long transform block at a time point when it is difficult to raise the integration degree of a semiconductor should be avoided since this increases the manufacturing cost.
By quantizing signals split into plural frequency bands by a filter or orthogonal transform, the frequency band in which occurs the quantization noise can be controlled so that encoding can be achieved with psychoacoustic higher efficiency by using acoustic characteristics such as masking effects. If the signal components are normalized with the maximum values of the absolute values of the signal components in the respective bands, encoding can be achieved with still higher efficiency.
As frequency band widths in case of quantizing the frequency components, obtained on splitting the frequency spectrum, it is known to split the frequency spectrum such as to take account of the psychoacoustic characteristics of the human auditory system. Specifically, the audio signals are divided into a plurality of, such as 25, bands using bandwidths increasing with increasing frequency. These bands are known as critical bands. In encoding the band-based data, encoding is carried out by fixed or adaptive bit allocation on the band basis. In encoding coefficient data obtained by MDCT processing by bit allocation as described above, encoding is by an adaptive number of bit allocation for band-based MDCT coefficients obtained by block-based MDCT processing. As these bit allocation techniques, there are known the following two techniques.
For example, in R. Zelinsky and P. Noll, Adaptive Transform Coding of Speech Signals and in ‘IEEE Transactions of Acoustics, Speech and Signal Processing, vol. ASSP-25, No.4, August 1977, bit allocation is performed on the basis of the magnitude of the band-based signals. With this system, the quantization noise spectrum becomes flat, such that the quantization noise is minimized. However, the actual noise feeling is not psychoacoustically optimum because the psychoacoustic masking effect is not exploited.
In a publication ‘ICASSP 1980, The critical band coder—digital encoding of the perceptual requirements of the auditory system, M. A. Krasner, MIT’, the psychoacoustic masking mechanism is used to determine a fixed bit allocation that produces the necessary signal-to-noise ratio for each critical band. However, if this technique is used to measure characteristics of a sine wave input, non-optimum results are obtained because of the fixed allocation of bits among the critical bands.
For overcoming these problems, there is proposed a high-efficiency encoding device in which a portion of the total number of bits usable for bit allocation is used for a fixed bit allocation pattern pre-fixed from one small block to another and the remaining portion is used for bit allocation dependent on the signal amplitudes of the respective blocks, and in which the bit number division ratio between the fixed bit allocation and the bit allocation dependent on the signal amplitudes is made dependent on a signal related to an input signal, such that the bit number division ratio to the fixed bit allocation becomes larger the smoother the signal spectrum.
This technique significantly improves the signal-to-noise ratio on the whole by allocating more bits to a block including a particular signal spectrum exhibiting concentrated signal energy. By using the above techniques, for improving the signal-to-noise ratio characteristics, not only the measured values are increased, but also the sound a
Shimoyoshi Osamu
Tsutsui Kyoya
Sonneschein Nath & Rosenthal
Sony Corporation
LandOfFree
Encoding method and apparatus and recording medium does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Encoding method and apparatus and recording medium, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Encoding method and apparatus and recording medium will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2866877