Codebook structure for changeable pulse multimode speech coding

Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission

Reexamination Certificate

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details Codebook structure for changeable pulse multimode speech coding Codebook structure for changeable pulse multimode speech coding

: 2000-09-15
: 2003-04-29
: To, Doris H. (Department: 2654)
: Data processing: speech signal processing, linguistics, language
: Speech signal processing
: For storage or transmission

: C704S219000, C704S212000, C704S227000
: Reexamination Certificate
: active
: 06556966
: ABSTRACT:

BACKGROUND OF THE INVENTION
1. Technical Field
This invention relates to speech communication systems and, more particularly, to systems and methods for digital speech coding.
2. Related Art
One prevalent mode of human communication involves the use of communication systems. Communication systems include both wireline and wireless radio systems. Wireless communication systems electrically connect with the landline systems and communicate using radio frequency (RF) with mobile communication devices. Currently, the radio frequencies available for communication in cellular systems, for example, are in the frequency range centered around 900 MHz and in the personal communication services (PCS) frequency range centered around 1900 MHz. Due to increased traffic caused by the expanding popularity of wireless communication devices, such as cellular telephones, it is desirable to reduce bandwidth of transmissions within the wireless systems.
Digital transmission in wireless radio telecommunications is increasingly being applied to both voice and data due to noise immunity, reliability, compactness of equipment and the ability to implement sophisticated signal processing functions using digital techniques. Digital transmission of speech signals involves the steps of: sampling an analog speech waveform with an analog-to-digital converter, speech compression (encoding), transmission, speech decompression (decoding), digital-to-analog conversion, and playback into an earpiece or a loudspeaker. The sampling of the analog speech waveform with the analog-to-digital converter creates a digital signal. However, the number of bits used in the digital signal to represent the analog speech waveform creates a relatively large bandwidth. For example, a speech signal that is sampled at a rate of 8000 Hz (once every 0.125 ms), where each sample is represented by 16 bits, will result in a bit rate of 128,000 (16×8000) bits per second, or 128 kbps (kilo bits per second).
Speech compression reduces the number of bits that represent the speech signal, thus reducing the bandwidth needed for transmission. However, speech compression may result in degradation of the quality of decompressed speech. In general, a higher bit rate will result in higher quality, while a lower bit rate will result in lower quality. However, speech compression techniques, such as coding techniques, can produce decompressed speech of relatively high quality at relatively low bit rates. In general, low bit rate coding techniques attempt to represent the perceptually important features of the speech signal, with or without preserving the actual speech waveform.
Typically, parts of the speech signal for which adequate perceptual representation is more difficult or more important (such as voiced speech, plosives or voice onsets) are coded and transmitted using a higher number of bits. Parts of the speech signal for which adequate perceptual representation is less difficult or less important (such as unvoiced, or the silence between words) are coded with a lower number of bits. The resulting average bit rate for the speech signal will be relatively lower than would be the case for a fixed bit rate that provides decompressed speech of similar quality.
These speech compression techniques have resulted in lowering the amount of bandwidth used to transmit a speech signal. However, further reduction in bandwidth is important in a communication system for a large number of users. Accordingly, there is a need for systems and methods of speech coding that are capable of minimizing the average bit rate needed for speech representation, while providing high quality decompressed speech.
SUMMARY
The invention provides a way to construct an efficient codebook structure and a fast search approach, which in one example are used in an SMV system. The SMV system varies the encoding and decoding rates in a communications device, such as a mobile telephone, a cellular telephone, a portable radio transceiver or other wireless or wire line communication device. The disclosed embodiments describe a system for varying the rates and associated bandwidth in accordance with an signal from an external source, such as the communication system with which the mobile device interacts. In various embodiments, the communications system selects a mode for the communications equipment using the system, and speech is processed according to that mode.
One embodiment of a speech compression system includes a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec each capable of encoding and decoding speech signals. The speech compression system performs a rate selection on a frame by frame basis of a speech signal to select one of the codecs. The speech compression system then utilizes a fixed codebook structure with a plurality of subcodebooks. A search routine selects a best codevector from among the codebooks in encoding and decoding the speech. The search routine is based on minimizing an error function in an iterative fashion.
Accordingly, the speech coder is capable of selectively activating the codecs to maximize the overall quality of a reconstructed speech signal while maintaining the desired average bit rate. Other systems, methods, features and advantages of the invention will be or will become apparent to one with skill in the art upon examination of the following figures and detailed description. It is intended that all such additional systems, methods, features and advantages included within this description be within the scope of the invention, and be protected by the accompanying claims.

REFERENCES:
patent: 4868867 (1989-09-01), Davidson et al.
patent: 5263088 (1993-11-01), Hazu et al.
patent: 5323486 (1994-06-01), Taniguchi et al.
patent: 5602962 (1997-02-01), Kellermann
patent: 5701392 (1997-12-01), Adoul et al.
patent: 5717825 (1998-02-01), Lamblin
patent: 5924062 (1999-07-01), Maung
patent: 5970444 (1999-10-01), Hayashi et al.
patent: 6041297 (2000-03-01), Goldberg
patent: 6097751 (2000-08-01), Relph
patent: 6173257 (2001-01-01), Gao
patent: 6393390 (2002-05-01), Patel et al.
patent: 0516439 (1992-12-01), None
patent: 0577488 (1994-01-01), None
patent: 0596847 (1994-05-01), None
patent: 0751496 (1997-01-01), None
Kataoka et al (“Improved CS-CELP Speech Coding in a Noisy Environment Using a Trained Sparse Conjugate Codebook” International Conference on Acoustics, Speech, and Signal Processing, May 1995).*
Database Inspec Online! Institute of Electrical Engineers, Stevenage, GB Kim et al.: “Complexity reduction methods for vector sum excited linear prediction coding” Database accession No. 5027941 XP002126377 & Proceedings of 1994 International Conference on Spoken Language Processing (ICSLP '94), vol. 4, Sep. 18-22, 1994, pp. 2071-2074 Yokohama, JP.
Berouti M et al: “Efficient computation and encoding of the multipulse excitation for LPC” International Conference on Acoustics, Speech & Signal Processing, ICASSP. San Diego, Mar. 19-21, 1984, New York, IEEE, US, vol. 1 Conf. 9, Mar. 19, 1984, pp. 10101-10104, XP 002083781 paragraph '02.1! paragraph '05.1!.
Salami R A et al: “Performance of Error Protected Binary Pulse Excitation Corders at 11.4 KB/S Over Mobile Radio Channels” Speech Processing 1. Albuquerque, Apr. 3-6, 1990, International Conference on Acoustics, Speech & Signal Processing, ICASSP, New York, IEEE, US, vol. 1 Conf. 15, Apr. 3, 1990, pp. 473-476, XP000146508 paragraph '0002!.
A. Kataoka, S. Hosaka, J. Ikedo, T. Moriya & S. Hayashi, “Improved CS-CELP Speech Coding in a Noisy Environment using a Trained Sparse Conjugate Codebook”, 1995 International Conference on Acoustics, Speech & Signal Processing, May 1995.*
A. Chmielewski, J. Domaszewicz, J. Milek, “Real Time Implementation of Forward Gain-Adaptive Vector Quantizer,” 8th European Conference Proceedings on Electrotechnics, 1988 & Conference Proceedings on Area Communication, EUROCON '88, Jun. 1988.*
Sridha Sridhan & John Leis, “Two Novel Lossless Algorithms to Exploit Index Redundancy in VQ Speech Compression,”

Affiliated with

Gao Yang

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

Conexant Systems Inc.

Corporate Assignee

[ 0.00 ] – not rated yet Voters 0 Comments 0

Farjami & Farjami LLP

Law Firm

[ 0.00 ] – not rated yet Voters 0 Comments 0

Nolan Daniel A.

Examiner

[ 0.00 ] – not rated yet Voters 0 Comments 0

To Doris H.

Examiner

[ 0.00 ] – not rated yet Voters 0 Comments 0

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Codebook structure for changeable pulse multimode speech coding does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Codebook structure for changeable pulse multimode speech coding, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Codebook structure for changeable pulse multimode speech coding will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFUS-PAI-O-3092446

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure