Multimode speech encoder and decoder apparatuses

Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S201000, C704S500000, C704S203000, C704S221000, C704S229000, C704S230000

Reexamination Certificate

active

06334105

ABSTRACT:

TECHNICAL FIELD
The present invention relates to a low-bit-rate speech coding apparatus which performs coding on a speech signal to transmit, for example, in a mobile communication system, and more particularly, to a CELP (Code Excited Linear Prediction) type speech coding apparatus which separates the speech signal to vocal tract information and excitation information to represent.
BACKGROUND ART
Used in the fields of digital mobile communications and speech storage are speech coding apparatuses which compress speech information to encode with high efficiency for utilization of radio signals and recording media. Among them, the system based on a CELP (Code Excited Linear Prediction) system is carried into practice widely for the apparatuses operating at medium to low bit rates. The technology of the CELP is described in “Code-excited Linear Prediction (CELP):High-quality Speech at Very Low Bit Rates” by M. R. Schroeder and B. S. Atal, Proc. ICASSP-85, 24.1.1, pp.937-940, 1985.
In the CELP type speech coding system, speech signals are divided into predetermined frame lengths (about 5 ms to 50 ms), linear prediction of the speech signals is performed for each frame, the prediction residual (excitation vector signal) obtained by the linear prediction for each frame is encoded using an adaptive code vector and random code vector comprised of known waveforms. The adaptive code vector and random code vector are selected for use respectively from an adaptive codebook storing previously generated excitation vectors and a random codebook storing the predetermined number of pre-prepared vectors with predetermined shapes. Used as the random code vectors stored in the random codebook are, for example, random noise sequence vectors and vectors generated by arranging a few pulses at different positions.
The CELP coding apparatus performs the LPC synthesis and quantization, pitch search, random codebook search, and gain codebook search using input digital signals, and transmits the quantized LPC (L), pitch period (P), a random codebook index (S) and a gain codebook index (G) to a decoder.
However, the above-mentioned conventional speech coding apparatus needs to cope with voiced speeches, unvoiced speeches and background noises using a single type of random codebook, and therefore it is difficult to encode all the input signals with high quality.
DISCLOSURE OF INVENTION
An object of the present invention is to provide a multimode speech coding apparatus and speech decoding apparatus capable of providing excitation coding with multimode without newly transmitting mode information, in particular, performing judgment of speech region
on-speech region in addition to judgment of voiced region/unvoiced region, and further increasing the improvement of coding/decoding performance performed with the multimode.
In the present invention, the mode determination is performed using static/dynamic characteristics of a quantized parameter representing spectral characteristics, modes of various codebooks for use in coding excitation vectors are switched based on the mode determination indicating the speech region
on-speech region or voiced region/unvoiced region. Further, in the present invention, the modes of various codebooks for use in decoding are switched using the mode information used in the coding in decoding.


REFERENCES:
patent: 5012519 (1991-04-01), Adlersberg
patent: 5224167 (1993-06-01), Taniguchi et al.
patent: 5414796 (1995-05-01), Jacobs et al.
patent: 5490130 (1996-02-01), Akagiri
patent: 5596676 (1997-01-01), Swaminathan
patent: 5706394 (1998-01-01), Wynn
patent: 5729655 (1998-03-01), Kolesnik et al.
patent: 5978762 (1999-11-01), Smyth et al.
patent: 6055619 (2000-04-01), North et al.
patent: 2290201 (1995-12-01), None
patent: 6118993 (1994-04-01), None
patent: 10143195 (1998-05-01), None
PCT International Search Report dated Nov. 30, 1999.
T. Morii et al., “Multi-Mode CELP Codec using Short-Term Characteristics of Speech,” Technical Report of IEICE, SP 95-80 (1995-11), pp. 55-62 (with abstract in English).
M. Oshikiri et al., “A Speech/Silence Segmentation Method using Spectral Variation and the Application to a Variable Rate Speech Codec,” Proceedings of the 1997 Spring Meeting of the Acoustical Society of Japan (1998), pp. 281-282 (with comments in English by the Applicant).
O. Mizuno et al., “Speech Discrimination using Dynamic and Static Spectral Features,” pp. 107-108 (with comments in English by the Applicant).
H. Tasaki et al., “Post Noise Smoother to Improve Low Bit Rate Speech Coding Performance under Background Noise Conditions,” pp. 237-238 ( with comments in English by the Applicant).
M. Oshikiri et al., “A 2.4 kbps Variable Bit Rate ADP-CELP Speech Coder,” p. 1492 (with comments in English by the Applicant.).
T. Yamaura et al., “Improving Excitation Coding in Pitch Position Synchronized CELP,” pp. 239-240 (with comments in English by the Applicant).
Abstract and partial claims from Japanese publication of U.S. Patent No. 5,596,678, Jan. 21, 1997, Karl T. Wigren et al. (three pages in English).
M. R. Schroeder et al., “Code-excited Linear Prediction (CELP): High-quality Speech at Very Low Bit Rates,” Proc. ICASSP-85, 24.1.1., pp. 937-940, 1985.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Multimode speech encoder and decoder apparatuses does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Multimode speech encoder and decoder apparatuses, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Multimode speech encoder and decoder apparatuses will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2560976

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.