Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
1998-09-18
2002-12-10
Banks-Harold, Marsha D. (Department: 2654)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
C704S220000, C704S225000
Reexamination Certificate
active
06493665
ABSTRACT:
INCORPORATION BY REFERENCE
The following applications are hereby incorporated herein by reference in their entirety and made part of the present application:
1) U.S. Provisional Application Ser. No. 60/097,569, entitled “Adaptive Rate Speech Codec” filed Aug. 24, 1998;
2) U.S. patent application Ser. No. 09/154,675, entitled “Speech Encoder Using Continuous Warping In Long Term Preprocessing,” filed Sep. 18, 1998;
3) U.S. patent application Ser. No. 09/156,814, entitled “Completed Fixed Codebook For Speech Encoder,” filed Sep. 18, 1998;
4) U.S. patent application Ser. No. 09/156,649, entitled “Comb Codebook Structure,” filed Sep. 18, 1998;
5) U.S. patent application Ser. No. 09/156,648, entitled “Low Complexity Random Codebook Structure,” filed Sep. 18, 1998;
6) U.S. patent application Ser. No. 09/156,650, entitled “Speech Encoder Using Gain Normalization That Combines Open And Closed Loop Gains,” filed Sep. 18, 1998;
7) U.S. patent application Ser. No. 09/156,832, entitled “Speech Encoder Using Voice Activity Detection In Coding Noise,” filed Sep. 18, 1998;
8) U.S. patent application Ser. No. 09/154,654, entitled “Pitch Determination Using Speech Classification And Prior Pitch Estimation,” filed Sep. 18, 1998;
9) U.S. patent application Ser. No. 09/154,657, entitled “Speech Encoder Using A Classifier For Smoothing Noise Coding,” filed Sep. 18, 1998;
10) U.S. patent application Ser. No. 09/156,826, entitled “Adaptive Tilt Compensation For Synthesized Speech Residual,” filed Sep. 18, 1998;
11) U.S. patent application Ser. No. 09/154,653, entitled “Synchronized Encoder-Decoder Frame Concealment Using Speech Coding Parameters,” filed Sep. 18, 1998;
12) U.S. patent application Ser. No. 09/154,663, entitled “Adaptive Gain Reduction To Produce Fixed Codebook Target Signal,” filed Sep. 18, 1998;
13) U.S. patent application Ser. No. 09/154,660, entitled “Speech Encoder Adaptively Applying Pitch Long-Term Prediction and Pitch Preprocessing With Continuous Warping,” filed Sep. 18, 1998.
MICROFICHE REFERENCE
A microfiche appendix B is included of a computer program listing. The total number of microfiche is 1. The total number of frames is 24.
BACKGROUND
1. Technical Field
The present invention relates generally to speech encoding and decoding in voice communication systems; and, more particularly, it relates to various techniques used with code-excited linear prediction coding to obtain high quality speech reproduction through a limited bit rate communication channel.
2. Related Art
Signal modeling and parameter estimation play significant roles in communicating voice information with limited bandwidth constraints. To model basic speech sounds, speech signals are sampled as a discrete waveform to be digitally processed. In one type of signal coding technique called LPC (linear predictive coding), the signal value at any particular time index is modeled as a linear function of previous values. A subsequent signal is thus linearly predictable according to an earlier value. As a result, efficient signal representations can be determined by estimating and applying certain prediction parameters to represent the signal.
Applying LPC techniques, a conventional source encoder operates on speech signals to extract modeling and parameter information for communication to a conventional source decoder via a communication channel. Once received, the decoder attempts to reconstruct a counterpart signal for playback that sounds to a human ear like the original speech.
A certain amount of communication channel bandwidth is required to communicate the modeling and parameter information to the decoder. In embodiments, for example where the channel bandwidth is shared and real-time reconstruction is necessary, a reduction in the required bandwidth proves beneficial. However, using conventional modeling techniques, the quality requirements in the reproduced speech limit the reduction of such bandwidth below certain levels.
Further limitations and disadvantages of conventional systems will become apparent to one of skill in the art after reviewing the remainder of the present application with reference to the drawings.
SUMMARY OF THE INVENTION
Various aspects of the present invention can be found in an embodiment of a speech encoder using an analysis by synthesis approach on a speech signal. The speech encoder comprises an adaptive codebook, a plurality of subcodebooks, and an encoder processing circuit. The encoder processing circuit utilizes parameters of the speech signal in favoring at least one of the plurality of subcodebooks over at least one other of the plurality of subcodebooks. Of course many variations are possible. For example, the speech signal may comprise a residual signal, and at least one of the parameters utilized may comprise pitch correlation, average to peak ratio, signal to noise ratio, etc. Similarly, the encoder processing circuit may also utilize a classifier in favoring the at least one of the plurality of subcodebooks over the at least one other of the plurality of subcodebooks.
Further aspects may be found in a similar speech encoder that also uses an analysis by synthesis approach on a speech signal. The speech encoder comprises a noise-like codebook, a pulse codebook, and an encoder processing circuit. The encoder processing circuit identifies a weighting to favor one of the noise-like codebook and the pulse codebook over one other of the noise-like codebook and the pulse codebook.
The identification of the weighting by the encoder processing circuit may involve consideration of pitch correlation, an average to peak ratio, and/or a signal to noise ratio. It may also involve application of a classifier which may, for example, identify a noise-like unvoiced residual. The identification of the weighting by the encoder processing circuit may also comprise applying a first weighting approach if a noise-like residual is detected, and a second weighting approach if the noise-like residual is not detected. The first and second weighting approaches may also both utilize a plurality of parameters extracted from the speech signal.
Other aspects, advantages and novel features of the present invention will become apparent from the following detailed description of the invention when considered in conjunction with the accompanying drawings.
REFERENCES:
patent: 5233660 (1993-08-01), Chen
patent: 5293449 (1994-03-01), Tzeng
patent: 5307441 (1994-04-01), Tzeng
patent: 5323486 (1994-06-01), Taniguchi et al.
patent: 5396576 (1995-03-01), Miki et al.
patent: 5451951 (1995-09-01), Elliott et al.
patent: 5657418 (1997-08-01), Gerson et al.
patent: 5657420 (1997-08-01), Jacobs et al.
patent: 5734789 (1998-03-01), Swaminathan et al.
patent: 5778338 (1998-07-01), Jacobs et al.
patent: 5826226 (1998-10-01), Ozawa
patent: 5899968 (1999-05-01), Navarro et al.
patent: 0688013 (1995-12-01), None
patent: 0751496 (1997-01-01), None
W. Bastiaan Kleijn and Peter Kroon, “The RCELP Speech-Coding Algorithm,” vol. 5, No. 5, Sep.-Oct. 1994, pp. 39/573—47/581.
C. Laflamme, J-P. Adoul, H.Y. Su, and S. Morissette, “On Reducing Computational Complexity of Codebook Search in CELP Coder Through the Use of Algebraic Codes,” 1990, pp. 177-180.
Chih-Chung Kuo, Fu-Rong Jean, and Hsiao-Chuan Wang, “Speech Classification Embedded in Adaptive Codebook Search for Low Bit-Rate CELP Coding,” IEEE Transactions on Speech and Audio Processing, vol. 3, No. 1, Jan. 1995, pp. 1-5.
Erdal Paksoy, Alan McCree, and Vish Viswanathan, “A Variable-Rate Multimodal Speech Coder with Gain-Matched Analysis-By-Synthesis,” 1997, pp. 751-754.
Gerhard Schroeder, “International Telecommunication Union Telecommunications Standardization Sector,” Jun. 1995, pp. i-iv, 1-42.
“Digital Cellular Telecommunications System; Comfort Noise Aspects for Enhanced Full Rate (EFR) Speech Traffic Channels (GSM 06.62),” May 1996, pp. 1-16.
W. B. Kleijn and K.K. Paliwal (Editors), Speech Coding and Synthesis, Elsevier Science B.V. ; Kroon and W.B. Kleijn (Authors), Chapter 3: “Linear-Prediction Based on Analysis-by-Synthesis Coding”, 1995, pp. 81-113.
W. B. Kleijn and
Gao Yang
Su Huan-Yu
Banks-Harold Marsha D.
Brinks Hofer Gilson & Lione
Conexant Systems Inc.
Opsasnick Michael N.
LandOfFree
Speech classification and parameter weighting used in... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speech classification and parameter weighting used in..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech classification and parameter weighting used in... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2986600