Apparatus and method for speech encoding based on short-term pre

Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

704219, G10L 502

Patent

active

059501553

DESCRIPTION:

BRIEF SUMMARY
TECHNICAL FIELD

This invention relates to a speech encoding method for encoding short-term prediction residuals or parameters representing short-term prediction coefficients of an input speech signal by vector or matrix quantization.


BACKGROUND ART

There are a variety of encoding methods known for encoding an audio signal, inclusive of a speech signal and an acoustic signal, by exploiting statistical properties of the audio signal in the time domain and in the frequency domain and the psychoacoustic characteristics of the human hearing system. These encoding methods may be roughly classified into encoding on the time domain, encoding on the frequency domain and analysis/synthesis encoding.
If, in multi-band excitation (MBE), single-band excitation (SBE), harmonic excitation, sub-band coding (SBC), linear predictive coding (LPC), discrete cosine transform (DCT), modified DCT (MDCT) or fast Fourier transform (FFT), as examples of high-efficiency coding for speech signals, various information data, such as spectral amplitudes or parameters thereof, such as LSP parameters, .alpha.-parameters or k-parameters, are quantized, scalar quantization has been usually adopted.
If, with such scalar quantization, the bit rate is decreased to e.g. 3 to 4 kbps to further increase the quantization efficiency, the quantization noise or distortion is increased, thus raising difficulties in practical utilization. Thus it is currently practiced to group different data given for encoding, such as time-domain data, frequency-domain data or filter coefficient data, into a vector, or to group such vectors across plural frames, into a matrix, and to effect vector or matrix quantization, in place of individually quantizing the different kinds of data.
For example, in code excitation linear prediction (CELP) encoding, LPC residuals are directly quantized by vector or matrix quantization as time-domain waveform. In addition, the spectral envelope in MBE encoding is similarly quantized by vector or matrix quantization.
If the bit rate is decreased further, it becomes infeasible to use enough bits to quantize parameters specifying the envelope of the spectrum itself or the LPC residuals, thus deteriorating the signal quality.
In view of the foregoing, it is an object of the present invention to provide a speech encoding method capable of affording satisfactory quantization characteristics even with a smaller number of bits.


DISCLOSURE OF THE INVENTION

With the speech encoding method according to the present invention, a first codebook and a second codebook are formed by assorting parameters representing short-term prediction values concerning a reference parameter comprised of one or a combination of a plurality of characteristic parameters of the input speech signal. The short-term prediction values are generated based upon the input speech signal. One of the first and second codebooks concerning the reference parameter of the input speech signal is selected and the short-term prediction values are quantized by referring to the selected codebook for encoding the input speech signal.
The short-term prediction values are short-term prediction coefficients or short-term prediction errors. The characteristic parameters include the pitch values of the speech signal, pitch strength, frame power, voiced/unvoiced discrimination flag and the gradient of the signal spectrum. The quantization is the vector quantization or the matrix quantization. The reference parameter is the pitch value of the speech signal. One of the first and second codebooks is selected in dependence upon the magnitude relationship between the pitch value of the input speech signal and a pre-set pitch value.
According to the present invention, the short-term prediction value, generated based upon the input speech signal, is quantized by referring to the selected codebook for improving the quantization efficiency.


BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a schematic block diagram showing a speech encoding device (encoder) as an illustrative example of a device fo

REFERENCES:
patent: 4791670 (1988-12-01), Copperi et al.
patent: 4811396 (1989-03-01), Yatsuzuka
patent: 4817157 (1989-03-01), Gerson
patent: 4860355 (1989-08-01), Copperi
patent: 5007092 (1991-04-01), Galand et al.
patent: 5202926 (1993-04-01), Miki
patent: 5327320 (1994-07-01), Chen
patent: 5371853 (1994-12-01), Kao et al.
patent: 5414796 (1995-05-01), Jacobs et al.
patent: 5487086 (1996-01-01), Bhaskar
patent: 5491771 (1996-02-01), Gupta et al.
patent: 5533052 (1996-07-01), Bhaskar
patent: 5546498 (1996-08-01), Sereno
patent: 5602959 (1997-02-01), Bergstrom et al.
patent: 5602961 (1997-02-01), Kolesnik et al.
patent: 5642465 (1997-06-01), Scott et al.
patent: 5651026 (1997-07-01), Lin et al.
patent: 5699481 (1997-12-01), Shlomot et al.
patent: 5699485 (1997-12-01), Shoham
patent: 5710863 (1998-01-01), Chen
patent: 5732389 (1998-03-01), Kroon et al.
Schroeder, Mangred Code Excited Linear Prediction (CELP): High Quality Speech at Very Low Bit Rates, Internation Conference on Acoustics, Speech and Signal Processing 85, vol. 3, Mar. 1985 Tampa.
Rabiner et al. Fundamentals of Speech Recogntion. 129-131. 254, 1993.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Apparatus and method for speech encoding based on short-term pre does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Apparatus and method for speech encoding based on short-term pre, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Apparatus and method for speech encoding based on short-term pre will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1815215

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.