Information encoding method and apparatus, information...

Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S201000, C704S211000

Reexamination Certificate

active

06314391

ABSTRACT:

BACKGROUND OF THE INVENTION
1. Field of the Invention
This invention relates to an information encoding method and apparatus, suitable for expanding the format of the encoded signals, an information decoding method and apparatus, as counterparts of the information encoding method and apparatus, and an information recording medium having the encoded information recorded thereon.
2. Description of the Related Art
There has so far been proposed an information recording medium capable of recording signals such as encoded acoustic information or music information (referred to hereinafter as audio signals), such as a magneto-optical disc. Among methods for high-efficiency encoding of the audio signals, there are a so-called transform coding which is a blocking frequency spectrum splitting method of transforming a time-domain signal into frequency domain signals by orthogonal transform and encoding the spectral components from one frequency band to another, and a sub-band encoding (SBC) method, which is a non-blocking frequency spectrum splitting method of splitting the time-domain audio signals into plural frequency bands without blocking and encoding the resulting signals of the frequency bands. There is also known a high-efficiency encoding technique which is a combination of the sub-band coding and transform coding, in which case the time domain signals are split into plural frequency bands by SBC and the resulting band signals are orthogonally transformed into spectral components which are encoded from band to band.
Among the above-mentioned filters is a so-called QMF filter as discussed in 1976, R. E. Crochiere, Digital Coding of Speech in subbands, Bell Syst. Tech. J. Vol. 55, No. 8, 1976. This QMF filter splits the frequency spectrum into two bands of equal bandwidths and is characterized in that so-called aliasing is not produced on subsequently synthesizing the split bands. The technique of dividing the frequency spectrum is discussed in Joseph H. Rothweiler, Polyphase Quadrature Filters—A New Subband Coding Technique, ICASSP 83 BOSTON. This polyphase quadrature filter is characterized in that the signal can be split into plural bands of equal band-width.
Among the above-mentioned techniques for orthogonal transform is such a technique in which an input audio signal is blocked every pre-set unit time, such as every frame, and discrete Fourier transform (DFT), discrete cosine transform (DCT) or modified DCT (MDCT) is applied to each block for converting the signals from the time axis to the frequency axis. Discussions of the MDCT are found in J. P. Princen and A. B. Bradley, Subband/Transform Coding Using Filter Bank Designs Based on Time Domain Aliasing Cancellation, ICASSP 1987.
If the above-mentioned DFT or DCT is used as a method for transforming waveform signals into spectral signals, and a transform is applied based on a time block composed of M samples, M independent real-number data are obtained. It is noted that, for reducing junction distortions between time blocks, a given time block is usually overlapped with M
1
samples with both neighboring blocks, and M real-number data on an average are quantized and encoded in DFT or DCT for (M-M
1
) samples. It is these M real-number data that are subsequently quantized and encoded.
On the other hand, if the above-mentioned MDCT is used as a method for orthogonal transform, M independent real-number data are obtained from 2M samples overlapped with N samples of both neighboring time blocks. Thus, in MDCT, M real-number data on an average are obtained for M samples and subsequently quantized and encoded. A decoding device adds waveform elements obtained on inverse transform in each block from the codes obtained by MDCT with interference for re-constructing the waveform signals.
In general, if a time block for a transform is lengthened, the spectrum frequency resolution is improved such that the signal energy is concentrated in specified frequency components. Therefore, by using MDCT in which, by overlapping with one half of each of both neighboring blocks, transform is carried out with long block lengths, and in which the number of the resulting spectral signals is not increased beyond the number of the original time samples, encoding can be carried out with higher efficiency than if the DFT or DCT is used. Moreover, since the neighboring blocks have a sufficiently long overlap with each other, the inter-block distortion of the waveform signals can be reduced. However, if the transform block length for a transform is lengthened, more work area is required for the transform, thus making a reduction in size of a reproducing means more difficult.
By quantizing signals split into plural frequency bands by a filter or orthogonal transform, the frequency band in which the quantization noise occurs can be controlled so that encoding can be achieved with higher psychoacoustic efficiency by exploiting acoustic characteristics such as masking effects. If the signal components are normalized with the maximum values of the absolute values of the signal components in the respective bands, encoding can be achieved with still higher efficiency.
As frequency band widths in case of quantizing the frequency components, obtained on splitting the frequency spectrum, it is known to split the frequency spectrum such as to take account of the psychoacoustic characteristics of the human auditory system. Specifically, the audio signals are divided into a plurality of, such as 25, bands using bandwidths increasing with increasing frequency. These bands are known as critical bands. In encoding the band-based data, encoding is carried out by fixed or adaptive bit allocation on the band basis. In encoding coefficient data obtained by MDCT processing by bit allocation as described above, encoding is by an adaptive number of bit allocation for band-based MDCT coefficients obtained by block-based MDCT processing. Among the prior art bit allocation techniques, there are known the following two techniques.
For example, in R. Zelinsky and P. Noll, Adaptive Transform Coding of Speech Signals and in “IEEE Transactions of Acoustics, Speech and Signal Processing”, vol. ASSP-25, No. 4, August 1977, bit allocation is performed on the basis of the magnitude of the band-based signals. With this system, the quantization noise spectrum becomes flat, such that the quantization noise is minimized. However, the actual noise feeling is not psychoacoustically optimum because the psychoacoustic masking effect is not exploited.
In a publication “ICASSP 1980, The Critical Band Coder-Digital Encoding of the Perceptual Requirements of the Auditory System”, M. A. Krasner, MIT, the psychoacoustic masking mechanism is used to determine a fixed bit allocation that produces the necessary signal-to-noise ratio for each critical band. However, if this technique is used to measure characteristics of a sine wave input, non-optimum results are obtained because of the fixed allocation of bits among the critical bands.
For overcoming these problems, there is proposed a high-efficiency encoding device in which a portion of the total number of bits usable for bit allocation is used for a fixed bit allocation pattern pre-fixed from one small block to another and the remaining portion is used for bit allocation dependent on the signal amplitudes of the respective blocks, and in which the bit number division ratio between the fixed bit allocation and the bit allocation dependent on the signal amplitudes is made dependent on a signal related to an input signal, such that the bit number division ratio to the fixed bit allocation becomes larger if the signal spectrum is smoother.
This technique significantly improves the signal-to-noise ratio on the whole by allocating more bits to a block including a particular signal spectrum exhibiting concentrated signal energy. By using the above techniques, for improving the signal-to-noise ratio characteristics, not only are the measured values increased, but also the sound as perceived by the listener is improved in signal quality, because the human audit

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Information encoding method and apparatus, information... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Information encoding method and apparatus, information..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Information encoding method and apparatus, information... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2571227

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.