Speech encoding method and speech encoding system

Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S501000, C704S207000

Reexamination Certificate

active

06581031

ABSTRACT:

FIELD OF THE INVENTION
This invention relates to a speech encoding method and a speech encoding system used to encode voice signal in high quality at a low bit rate.
BACKGROUND OF THE INVENTION
Known as a method of encoding voice signal in high efficiency is CELP (code excited linear predictive coding) described in, for example, M. Schroeder and B. Atal, “Code-Excited Linear Prediction: High Quality Speech at Very Low Bit Rates”, Proc. ICASSP, pp.937-940, 1985 (prior art 1), and Kleij et al., “Improved Speech Quality and Efficient Vector Quantization in SELP”, Proc. ICASSP, pp.155-158, 1988 (prior art 2).
In CELP, on the transmission side, for each frame, e.g. 20 ms, spectral parameter to spectral characteristic is extracted from speech signal by using LPC (linear predictive coding) analysis. A frame is further divided into subframes, e.g. 5 ms, and for each subframe, based on past excitation signal, parameters (delay. parameter and gain parameter corresponding to pitch cycle) at adaptive codebook are extracted, and speech signal of the subframe is pitch-predicted by the adaptive codebook. For excitation signal obtained by the pitch-predicting, an optimum sound-source code vector is selected from a sound-source codebook (vector quantization codebook) composed of a predetermined kind of noise signals, and the excitation signal is quantized by calculating optimum gain. The selection of sound-source code vector is conducted so that the error electric power between signal synthesized by the selected noise signal and residual signal can be minimized. Then, the index and gain to indicate the kind of code vector selected, the spectral parameter and the adaptive codebook parameter are combined by a multiplexer and transmitted.
However, in CELP described above, there is a problem that when the delay of adaptive codebook extracted for current subframe is more than an integer times or less than the inverse number of an integer times, where the integer is two or more, the delay of adaptive codebook calculated for the previous subframe, between the previous codebook and current codebook, the delay of adaptive codebook becomes discontinuous and therefore the tone quality deteriorates. The reason is as follows: although the delay of adaptive codebook extracted for current subframe is searched near a pitch cycle calculated from speech signal by a pitch calculator, when the pitch cycle becomes more than an integer times or less than the inverse number of an integer times the delay of adaptive codebook calculated for the previous subframe, the search range of adaptive codebook for the current subframe does not include near the delay of adaptive codebook for the previous subframe. Therefore, between the previous codebook and current codebook, the delay of adaptive codebook becomes discontinuous in the process of time.
SUMMARY OF THE INVENTION
Accordingly, it is an object of the invention to provide a speech encoding method and a speech encoding system that the delay of adaptive codebook calculated for each subframe can be prevented from being discontinuous in the process of time.
According to the invention, a speech encoding method, comprises the steps of:
calculating a spectral parameter from speech signal to be input and quantizing the spectral parameter;
calculating delay and gain from excitation signal quantized in the past according to an adaptive codebook and calculating the residual by predicting speech signal, based on a pitch cycle;
quantizing the excitation signal of the speech signal by using the spectral parameter;
quantizing the gain of the excitation signal; and
limiting the search range in searching the pitch cycle based on the delay of adaptive codebook calculated in the past and searching the pitch cycle from the speech signal.
According to another aspect of the invention, a speech encoding method, comprises the steps of:
calculating a spectral parameter from speech signal to be input and quantizing the spectral parameter;
calculating delay and gain from excitation signal quantized in the past according to an adaptive codebook and calculating the residual by predicting speech signal, based on a pitch cycle;
quantizing the excitation signal of the speech signal by using the spectral parameter;
quantizing the gain of the excitation signal;
determining a mode by extracting a characteristic quantity from the speech signal; and
limiting the search range in searching the pitch cycle based on the delay of adaptive codebook calculated in the past and searching the pitch cycle from the speech signal, when the determined mode corresponds to a predetermined mode.
According to another aspect of the invention, a speech encoding system, comprises:
a spectral parameter calculation unit that calculates a spectral parameter from speech signal to be input and quantizes the spectral parameter;
a pitch calculation unit that outputs calculating a pitch cycle from the speech signal;
an adaptive codebook unit that calculates delay and gain from excitation signal quantized in the past according to an adaptive codebook and calculates the residual by predicting speech signal, based on the output of the pitch calculation unit, and that outputs the calculated delay and gain;
a excitation quantization unit that outputs quantizing the excitation signal of the speech signal by using the spectral parameter;
a gain quantization unit that outputs quantizing the gain of the excitation signal; and
a limiter unit that limits the search range in searching the pitch cycle based on the delay of adaptive codebook calculated in the past;
wherein the pitch calculation unit outputs searching the pitch cycle based on the output of the limiter unit.
According to another aspect of the invention, a speech encoding system, comprises:
a spectral parameter calculation unit that calculates a spectral parameter from speech signal to be input and quantizes the spectral parameter;
a pitch calculation unit that outputs calculating a pitch cycle from the speech signal;
an adaptive codebook unit that calculates multiple delays and gain from excitation signal quantized in the past according to an adaptive codebook and calculates the residual by predicting speech signal, based on the output of the pitch calculation unit, and that outputs the calculated delays and gain;
a excitation quantization unit that quantizes the excitation signal of the speech signal for each of the multiple delays by using the spectral parameter and then outputs selecting one with smaller signal distortion;
a gain quantization unit that outputs quantizing the gain of the excitation signal; and
a limiter unit that limits the search range in searching the pitch cycle based on the delay of adaptive codebook calculated in the past;
wherein the pitch calculation unit outputs searching the pitch cycle based on the output of the limiter unit.
According to another aspect of the invention, a speech encoding system, comprises:
a spectral parameter calculation unit that calculates a spectral parameter from speech signal to be input and quantizes the spectral parameter;
a pitch calculation unit that outputs calculating a pitch cycle from the speech signal;
an adaptive codebook unit that calculates delay and gain from excitation signal quantized in the past according to an adaptive codebook and calculates the residual by predicting speech signal, based on the output of the pitch calculation unit, and that outputs the calculated delay and gain;
a excitation quantization unit that outputs quantizing the excitation signal of the speech signal by using the spectral parameter;
a mode determination unit that determines a mode by extracting a characteristic quantity from the speech signal;
a gain quantization unit that outputs quantizing the gain of the excitation signal; and
a limiter unit that limits the search range in searching the pitch cycle based on the delay of adaptive codebook calculated in the past, when the output of the mode determination unit corresponds to a predetermined mode;
wherein the pitch calculation unit outputs searching the pitch cycle based on the output of

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Speech encoding method and speech encoding system does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Speech encoding method and speech encoding system, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech encoding method and speech encoding system will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3155498

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.