Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
2000-09-15
2003-04-29
To, Doris H. (Department: 2654)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
C704S219000, C704S212000, C704S227000
Reexamination Certificate
active
06556966
ABSTRACT:
BACKGROUND OF THE INVENTION
1. Technical Field
This invention relates to speech communication systems and, more particularly, to systems and methods for digital speech coding.
2. Related Art
One prevalent mode of human communication involves the use of communication systems. Communication systems include both wireline and wireless radio systems. Wireless communication systems electrically connect with the landline systems and communicate using radio frequency (RF) with mobile communication devices. Currently, the radio frequencies available for communication in cellular systems, for example, are in the frequency range centered around 900 MHz and in the personal communication services (PCS) frequency range centered around 1900 MHz. Due to increased traffic caused by the expanding popularity of wireless communication devices, such as cellular telephones, it is desirable to reduce bandwidth of transmissions within the wireless systems.
Digital transmission in wireless radio telecommunications is increasingly being applied to both voice and data due to noise immunity, reliability, compactness of equipment and the ability to implement sophisticated signal processing functions using digital techniques. Digital transmission of speech signals involves the steps of: sampling an analog speech waveform with an analog-to-digital converter, speech compression (encoding), transmission, speech decompression (decoding), digital-to-analog conversion, and playback into an earpiece or a loudspeaker. The sampling of the analog speech waveform with the analog-to-digital converter creates a digital signal. However, the number of bits used in the digital signal to represent the analog speech waveform creates a relatively large bandwidth. For example, a speech signal that is sampled at a rate of 8000 Hz (once every 0.125 ms), where each sample is represented by 16 bits, will result in a bit rate of 128,000 (16×8000) bits per second, or 128 kbps (kilo bits per second).
Speech compression reduces the number of bits that represent the speech signal, thus reducing the bandwidth needed for transmission. However, speech compression may result in degradation of the quality of decompressed speech. In general, a higher bit rate will result in higher quality, while a lower bit rate will result in lower quality. However, speech compression techniques, such as coding techniques, can produce decompressed speech of relatively high quality at relatively low bit rates. In general, low bit rate coding techniques attempt to represent the perceptually important features of the speech signal, with or without preserving the actual speech waveform.
Typically, parts of the speech signal for which adequate perceptual representation is more difficult or more important (such as voiced speech, plosives or voice onsets) are coded and transmitted using a higher number of bits. Parts of the speech signal for which adequate perceptual representation is less difficult or less important (such as unvoiced, or the silence between words) are coded with a lower number of bits. The resulting average bit rate for the speech signal will be relatively lower than would be the case for a fixed bit rate that provides decompressed speech of similar quality.
These speech compression techniques have resulted in lowering the amount of bandwidth used to transmit a speech signal. However, further reduction in bandwidth is important in a communication system for a large number of users. Accordingly, there is a need for systems and methods of speech coding that are capable of minimizing the average bit rate needed for speech representation, while providing high quality decompressed speech.
SUMMARY
The invention provides a way to construct an efficient codebook structure and a fast search approach, which in one example are used in an SMV system. The SMV system varies the encoding and decoding rates in a communications device, such as a mobile telephone, a cellular telephone, a portable radio transceiver or other wireless or wire line communication device. The disclosed embodiments describe a system for varying the rates and associated bandwidth in accordance with an signal from an external source, such as the communication system with which the mobile device interacts. In various embodiments, the communications system selects a mode for the communications equipment using the system, and speech is processed according to that mode.
One embodiment of a speech compression system includes a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec each capable of encoding and decoding speech signals. The speech compression system performs a rate selection on a frame by frame basis of a speech signal to select one of the codecs. The speech compression system then utilizes a fixed codebook structure with a plurality of subcodebooks. A search routine selects a best codevector from among the codebooks in encoding and decoding the speech. The search routine is based on minimizing an error function in an iterative fashion.
Accordingly, the speech coder is capable of selectively activating the codecs to maximize the overall quality of a reconstructed speech signal while maintaining the desired average bit rate. Other systems, methods, features and advantages of the invention will be or will become apparent to one with skill in the art upon examination of the following figures and detailed description. It is intended that all such additional systems, methods, features and advantages included within this description be within the scope of the invention, and be protected by the accompanying claims.
REFERENCES:
patent: 4868867 (1989-09-01), Davidson et al.
patent: 5263088 (1993-11-01), Hazu et al.
patent: 5323486 (1994-06-01), Taniguchi et al.
patent: 5602962 (1997-02-01), Kellermann
patent: 5701392 (1997-12-01), Adoul et al.
patent: 5717825 (1998-02-01), Lamblin
patent: 5924062 (1999-07-01), Maung
patent: 5970444 (1999-10-01), Hayashi et al.
patent: 6041297 (2000-03-01), Goldberg
patent: 6097751 (2000-08-01), Relph
patent: 6173257 (2001-01-01), Gao
patent: 6393390 (2002-05-01), Patel et al.
patent: 0516439 (1992-12-01), None
patent: 0577488 (1994-01-01), None
patent: 0596847 (1994-05-01), None
patent: 0751496 (1997-01-01), None
Kataoka et al (“Improved CS-CELP Speech Coding in a Noisy Environment Using a Trained Sparse Conjugate Codebook” International Conference on Acoustics, Speech, and Signal Processing, May 1995).*
Database Inspec Online! Institute of Electrical Engineers, Stevenage, GB Kim et al.: “Complexity reduction methods for vector sum excited linear prediction coding” Database accession No. 5027941 XP002126377 & Proceedings of 1994 International Conference on Spoken Language Processing (ICSLP '94), vol. 4, Sep. 18-22, 1994, pp. 2071-2074 Yokohama, JP.
Berouti M et al: “Efficient computation and encoding of the multipulse excitation for LPC” International Conference on Acoustics, Speech & Signal Processing, ICASSP. San Diego, Mar. 19-21, 1984, New York, IEEE, US, vol. 1 Conf. 9, Mar. 19, 1984, pp. 10101-10104, XP 002083781 paragraph '02.1! paragraph '05.1!.
Salami R A et al: “Performance of Error Protected Binary Pulse Excitation Corders at 11.4 KB/S Over Mobile Radio Channels” Speech Processing 1. Albuquerque, Apr. 3-6, 1990, International Conference on Acoustics, Speech & Signal Processing, ICASSP, New York, IEEE, US, vol. 1 Conf. 15, Apr. 3, 1990, pp. 473-476, XP000146508 paragraph '0002!.
A. Kataoka, S. Hosaka, J. Ikedo, T. Moriya & S. Hayashi, “Improved CS-CELP Speech Coding in a Noisy Environment using a Trained Sparse Conjugate Codebook”, 1995 International Conference on Acoustics, Speech & Signal Processing, May 1995.*
A. Chmielewski, J. Domaszewicz, J. Milek, “Real Time Implementation of Forward Gain-Adaptive Vector Quantizer,” 8th European Conference Proceedings on Electrotechnics, 1988 & Conference Proceedings on Area Communication, EUROCON '88, Jun. 1988.*
Sridha Sridhan & John Leis, “Two Novel Lossless Algorithms to Exploit Index Redundancy in VQ Speech Compression,”
Conexant Systems Inc.
Farjami & Farjami LLP
Nolan Daniel A.
To Doris H.
LandOfFree
Codebook structure for changeable pulse multimode speech coding does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Codebook structure for changeable pulse multimode speech coding, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Codebook structure for changeable pulse multimode speech coding will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3092446