Method to determine the excitation pulse positions within a spee

Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

704220, 704223, G10L 1910

Patent

active

060649569

DESCRIPTION:

BRIEF SUMMARY
TECHNICAL FIELD

The present invention relates to a method of determining the positions of excitation pulses in a speech frame in a linear predictive speech encoder which operates in accordance with the multiple pulse principle. Such a speech encoder may be used in a mobile telephone system for instance, to compress the speech signals prior to their transmission from a mobile station.


BACKGROUND

Linear predictive speech encoders which operate in accordance with the aforesaid multipulse principle are known to the art; see, for instance, U.S. Pat. No. 3,624,302 which describes linear predictive encoding of speech signals, and U.S. Pat. No. 3,740,476 which describes how predictive parameters and prediction residual signals can be formed in such a speech encoder.
When forming an artificial speech signal by means of linear predictive coding, there are generated from the original signal a plurality of predictive parameters (a.sub.k) which characterize the artificial speech signal. Thus, there can be formed from these parameters a speech signal which does not contain the redundancy that is normally included in natural speech and which it is unnecessary to convert in speech transmission between, e.g., a mobile and a base station in a mobile radio system. From the aspect of bandwidth, it is more suitable to transmit solely the predictive parameters instead of the original speech signal, which requires a much higher bandwidth.
However, the speech signal thus regenerated in a receiver and constituting a synthetic speech signal may be difficult to understand as a result of a lack of agreement between the speech pattern of the original signal and the synthetic signal regenerated by means of the predictive parameters. These deficiencies have been described in detail in U.S. Pat. No. 4,472,832 (SE-B-456618) and can be alleviated to some extent by introducing so-called excitation pulses (multipulses) when constructing the synthetic speech replica. This is effected by partitioning the original speech input pattern into frame intervals. There is formed within each such interval a determined number of pulses of varying amplitude and phase position (time position) in accordance with the predictive parameters a.sub.k and also in accordance with the prediction residual d.sub.k between the speech input pattern and the speech replica. Each of the pulses is able to influence the speech pattern replica such as to obtain the smallest possible prediction residual. The generated excitation pulses have a relatively low bit rate and can therefore be encoded and transmitted on a narrow band, similar to the predictive parameters. This improves the quality of the regenerated speech signal.
In the aforesaid known method, the excitation pulses are generated within each frame interval of the speech input pattern by weighting the residual signal d.sub.k and feeding back and weighting the generated values for the excitation pulses each in a predictive filter. A correlation is then effected between the output signals on the two filters and the correlation is maximized for a number of signal elements from the correlated signal, such as to form the parameters (amplitude and phase position) of the excitation pulses. The advantage afforded by this multipulse algorithm for generating the excitation pulses is that different types of sound can be generated with a small number of pulses (for instance eight pulses/frame interval). The pulse-searching algorithm is general with respect to the pulse positions within the frame. It is possible to regenerate unvoiced sounds (consonants), which generally require randomly placed pulses and voiced sounds (vowels) which require positioning of pulses to be more collected.
These known methods calculate the correct phase positions of the excitation pulses within a frame and subsequent frames of the speech signal and positioning of the pulses, so-called pulse placement, is effected solely in dependence on complex processing of the speech signal parameters (prediction residuals, residual signal and the excitation pu

REFERENCES:
patent: 3624302 (1971-11-01), Atal
patent: 3740476 (1973-06-01), Atal
patent: 4472832 (1984-09-01), Atal et al.
patent: 4701954 (1987-10-01), Atal
patent: 4736428 (1988-04-01), Deprettere et al.
patent: 4932061 (1990-06-01), Kroon et al.
patent: 4944013 (1990-07-01), Gouvianakis et al.
patent: 5091944 (1992-02-01), Takahashi
patent: 5193140 (1993-03-01), Minde
patent: 5687284 (1997-11-01), Serizawa et al.
patent: 5737484 (1998-04-01), Ozawa
patent: 5754976 (1998-05-01), Adoul et al.
patent: 5937376 (1999-08-01), Minde

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method to determine the excitation pulse positions within a spee does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method to determine the excitation pulse positions within a spee, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method to determine the excitation pulse positions within a spee will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-267543

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.