Split band linear prediction vocoder with pitch extraction

Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S219000

Reexamination Certificate

active

06526376

ABSTRACT:

This invention relates to speech coders.
The invention finds particular, though not exclusive, application in telecommunications systems.
According to one aspect of the invention there is provided a speech coder including an encoder for encoding an input speech signal divided into frames each consisting of a predetermined number of digital samples, the encoder including: linear predictive coding (LPC) means for analysing samples and generating at least one set of linear prediction coefficients for each frame; pitch determination means for determining at least one value of pitch for each frame, the pitch determination means including first estimation means for analysing samples using a frequency domain technique (frequency domain analysis), second estimation means for analysing samples using a time domain technique (time domain analysis) and pitch evaluation means for using the results of said frequency domain and time domain analyses to derive a said value of pitch; voicing means for defining a measure of voiced and unvoiced signals in each frame; amplitude determination means for generating amplitude information for each frame, and quantisation means for quantising said set of linear prediction coefficients, said value of pitch, said measure of voiced and unvoiced signals and said amplitude information to generate a set of quantisation indices for each frame, wherein said first estimation means generates a first measure of pitch for each of a number of candidate pitch values, the second estimation means generates a respective second measure of pitch for each of said candidate pitch values and said evaluation means combines each of at least some of the first measures with the corresponding said second measure and selects one of the candidate pitch values by reference to the resultant combinations.
According to another aspect of the invention there is provided a speech coder including an encoder for encoding an input speech signal, the encoder comprising means for sampling the input speech signal to produce digital samples and for dividing the samples into frames each consisting of a predetermined number of samples, linear predictive coding (LPC) means for analysing samples and generating at least one set of linear prediction coefficients for each frame, pitch determination means for determining at least one value of pitch for each frame, voicing means for defining a measure of voiced and unvoiced signals in each frame, amplitude determnination means for generating amplitude information for each frame, and quantisation means for quantising said set of linear prediction coefficients, said value of pitch, said measure of voiced and unvoiced signals and said amplitude information to generate a set of quantisation indices for each frame, wherein said pitch determination means includes pitch estimation means for determining an estimate of the value of pitch and pitch refinement means for deriving the value of pitch from the estimate, the pitch refinement means defining a set of candidate pitch values including fractional values distributed about said estimate of the value of pitch determined by the pitch estimation means, identifying peaks in a frequency spectrum of the frame, for each said candidate pitch value correlating said peaks with amplitudes at different harmonic frequencies (k&ohgr;
o
) of a frequency spectrum of the frame, where
ω
o
=
2

π
P
,
P is a said candidate pitch value and k is an integer, and selecting as a said value of pitch the candidate pitch value giving the maximum correlation.
According to a further aspect of the invention there is provided a speech coder including an encoder for encoding an input speech signal, the encoder comprising means for sampling the input speech signal to produce digital samples and for dividing the samples into frames, each consisting of a predetermined number of samples, linear predictive coding (LPC) means for analysing samples and generating at least one set of linear prediction coefficients for each frame, pitch determination means for determining at least one value of pitch for each frame, voicing means for determining for each frame a voicing cut-off frequency for separating a frequency spectrum from the frame into a voiced part and an unvoiced part without evaluating the voiced/unvoiced status of individual harmonic frequency bands, amplitude determination means for generating amplitude information for each frame, and quantisation means for quantising said set of coefficients, said value of pitch, said voicing cut-off frequency and said amplitude information to generate a set of quantisation indices for each frame.
According to a yet further aspect of the invention there is provided a speech coder including an encoder for encoding an input speech signal, the encoder comprising, means for sampling the input speech signal to produce digital samples and for dividing the samples into frames each consisting of a predetermined number of samples, linear predictive coding (LPC) means for analysing samples and generating at least one set of linear prediction coefficients for each frame, pitch determination means for determining at least one value of pitch for each frame, voicing means for defining a measure of voiced and unvoiced signals in each frame, amplitude determination means for generating amplitude information for each frame, and quantisation means for quantising said set of prediction coefficients, said value of pitch, said measure of voiced and unvoiced signals and said amplitude information to generate a set of quantisation indices for each frame, wherein the amplitude determination means generates, for each frame, a set of spectral amplitudes for frequency bands centred on frequencies harmonically related to the value of pitch determined by the pitch determination means, and the quantisation means quantises the normalised spectral amplitudes to generate a first part of an amplitude quantisation index.
According to a yet further aspect of the invention there is provided a speech coder including an encoder for encoding an input speech signal, the encoder comprising means for sampling the input speech signal to produce digital samples and for dividing the samples into frames each consisting of a predetermined number of samples, linear predictive coding means for analysing samples to generate a respective set of Line Spectral Frequency (LSF) coefficients for a leading part and for a trailing part of each frame, pitch determination means for determining at least one value of pitch for each frame, voicing means for defining a measure of voiced and unvoiced signals in each frame, amplitude determination means for generating amplitude information for each frame, and quantisation means for quantising said sets of LSF coefficients, said value of pitch, said measure of voiced and unvoiced signals and said amplitude information to generate a set of quantisation indices, wherein said quantisation means defines a set of quantised LSF coefficients (LSF′
2
) for the leading part of the current frame by the expression
LSF

2
=&agr;
LSF

1
+(1−&agr;)
LSF

3
,
where LSF′
3
and LSF′
1
are respectively sets of quantised LSF coefficients for the trailing parts of the current frame and the frame immediately preceding the current frame, and &agr; is a vector in a first vector quantisation codebook, defines each said set of quantised LSF coefficients LSF′
2
,LSF′
3
for the leading and trailing parts respectively of the current frame as a combination of respective LSF quantisation vectors Q
2
,Q
3
of a second vector quantisation codebook and respective prediction values P
2
,P
3
, where P
2
=&lgr;Q
1
and P
3
=&lgr;Q
2
, &lgr; is a constant and Q
1
is a said LSF quantisation vector for the trailing part of said immediately preceding frame, and selects said vector Q
3
and said vector a from the first and second vector quantisation codebooks respectively to minimise a measure of distortion between the LSF coefficients generated by the linear pre

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Split band linear prediction vocoder with pitch extraction does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Split band linear prediction vocoder with pitch extraction, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Split band linear prediction vocoder with pitch extraction will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3181041

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.