Speech encoder using voice activity detection in coding noise

Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S219000, C704S201000

Reexamination Certificate

active

06823303

ABSTRACT:

BACKGROUND
1. Technical Field
The present invention relates generally to speech encoding and decoding in voice communication systems; and, more particularly, it relates to various techniques used with code-excited linear prediction coding to obtain high quality speech reproduction through a limited bit rate communication channel.
2. Related Art
Signal modeling and parameter estimation play significant roles in communicating voice information with limited bandwidth constraints. To model basic speech sounds, speech signals are sampled as a discrete waveform to be digitally processed. In one type of signal coding technique called LPC (linear predictive coding), the signal value at any particular time index is modeled as a linear function of previous values. A subsequent signal is thus linearly predictable according to an earlier value. As a result, efficient signal representations can be determined by estimating and applying certain prediction parameters to represent the signal.
Applying LPC techniques, a conventional source encoder operates on speech signals to extract modeling and parameter information for communication to a conventional source decoder via a communication channel. Once received, the decoder attempts to reconstruct a counterpart signal for playback that sounds to a human ear like the original speech.
A certain amount of communication channel bandwidth is required to communicate the modeling and parameter information to the decoder. In embodiments, for example where the channel bandwidth is shared and real-time reconstruction is necessary, a reduction in the required bandwidth proves beneficial. However, using conventional modeling techniques, the quality requirements in the reproduced speech limit the reduction of such bandwidth below certain levels.
Speech signals contain a significant amount of noise content. Traditional methods of coding noise often have difficulty in properly modeling noise which results in undesirable interruptions, discontinuities, and during conversation. Analysis by synthesis speech coders such as conventional code-excited linear predictive coders are unable to appropriately code background noise, especially at reduced bit rates. A different and better method of coding the background noise is desirable for good quality representation of background noise.
Further limitations and disadvantages of conventional systems will become apparent to one of skill in the art after reviewing the remainder of the present application with reference to the drawings.
SUMMARY OF THE INVENTION
Various aspects of the present invention can be found in a speech encoding system using an analysis by synthesis coding approach on a speech signal. The encoder processing circuit identifies a speech parameter of the speech signal using a speech signal analyzer. The speech signal analyzer may be used to identify multiple speech parameters of the speech signal. Upon processing these speech parameters, the speech encoder system classifies the speech signal as having either active or inactive voice content. Upon classification of the speech signal as having voice active content, a first coding scheme is employed for representing the speech signal. This coding information may be later used to reproduce the speech signal using a speech decoding system.
In certain embodiments of the invention, a weighted filter may filter the speech signal to assist in the identification of the speech parameters. The speech encoding system processes the identified speech parameters to determine the voice content of the speech signal. If voice content is identified, code-excited linear prediction is used to code the speech signal in one embodiment of the invention. If the speech signal is identified as voice inactive, then a random excitation sequence is used for coding of the speech signal. Additionally for voice inactive signals, an energy level and a spectral information are used to code the speech signal. The random excitation sequence may be generated in a speech decoding system of the invention. The random excitation sequence may alternatively be generated at the encoding end of the invention or be stored in a codebook. If desired, the manner by which the random excitation sequence was generated may be transmitted to the speech decoding system. However, in other embodiments of the invention the manner by which the random excitation sequence was generated may be omitted.
Other aspects, advantages and novel features of the present invention will become apparent from the following detailed description of the invention when considered in conjunction with the accompanying drawings.


REFERENCES:
patent: 5233660 (1993-08-01), Chen
patent: 5293449 (1994-03-01), Tzeng
patent: 5307441 (1994-04-01), Tzeng
patent: 5323486 (1994-06-01), Taniguchi et al.
patent: 5396576 (1995-03-01), Miki et al.
patent: 5451951 (1995-09-01), Elliott et al.
patent: 5657420 (1997-08-01), Jacobs et al.
patent: 5734789 (1998-03-01), Swaminathan et al.
patent: 5778338 (1998-07-01), Jacobs et al.
patent: 5826226 (1998-10-01), Ozawa
patent: 5899968 (1999-05-01), Navarro et al.
C.B. Southcott, D. Freeman, G. Cosier, D. Sereno, A. Van der Krogt, A. Gilloire, and H.J. Braun, “Voice Control of the Pan-European Digital Mobile Radio System,”Proceedings of the Global Telecommunications Conference and Exhibition(Globecom), US, New York, IEEE 1989, pp. 1070-1074.
Erdal Paksoy, Krishnaswamy Srinivasan, and Allen Gersho, “Variable Bit-Rate CELP Coding of Speech with Phonetic Classification,”European Transactions on Telecommunications and Related Technologies, IT, AEI, Milano,vol. 5, Sep.-Oct. 1994, pp. 57/591-67-601.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Speech encoder using voice activity detection in coding noise does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Speech encoder using voice activity detection in coding noise, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech encoder using voice activity detection in coding noise will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3359293

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.