Low bit-rate speech coding system and method using voicing proba

Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

704206, 704219, 704223, 704262, G10L 914, G10L 702

Patent

active

058901086

ABSTRACT:
A modular system and method is provided for low bit rate encoding and decoding of speech signals using voicing probability determination. The continuous input speech is divided into time segments of a predetermined length. For each segment the encoder of the system computes a model signal and subtracts the model signal from the original signal in the segment to obtain a residual excitation signal. Using the excitation signal the system computes the signal pitch and a parameter which is related to the relative content of voiced and unvoiced portions in the spectrum of the excitation signal, which is expressed as a ratio Pv, defined as a voicing probability. The voiced and the unvoiced portions of the excitation spectrum, as determined by the parameter Pv, are encoded using one or more parameters related to the energy of the excitation signal in a predetermined set of frequency bands. In the decoder, speech is synthesized from the transmitted parameters representing the model speech, the signal pitch, voicing probability and excitation levels in a reverse order. Boundary conditions between voiced and unvoiced segments are established to ensure amplitude and phase continuity for improved output speech quality. Perceptually smooth transition between frames is ensured by using an overlap and add method of synthesis. LPC interpolation and post-filtering is used to obtain output speech with improved perceptual quality.

REFERENCES:
patent: 4374302 (1983-02-01), Vogten et al.
patent: 4392018 (1983-07-01), Fette
patent: 4433434 (1984-02-01), Mozer
patent: 4435831 (1984-03-01), Mozer
patent: 4435832 (1984-03-01), Asada et al.
patent: 4468804 (1984-08-01), Kates et al.
patent: 4771465 (1988-09-01), Bronson et al.
patent: 4797926 (1989-01-01), Bronson et al.
patent: 4802221 (1989-01-01), Jibbe
patent: 4856068 (1989-08-01), Quatieri, Jr. et al.
patent: 4864620 (1989-09-01), Bialick
patent: 4885790 (1989-12-01), McAulay et al.
patent: 4937873 (1990-06-01), McAulay et al.
patent: 4945565 (1990-07-01), Ozawa et al.
patent: 4991213 (1991-02-01), Wilson
patent: 5023910 (1991-06-01), Thomson
patent: 5054072 (1991-10-01), McAulay et al.
patent: 5081681 (1992-01-01), Hardwick et al.
patent: 5189701 (1993-02-01), Jain
patent: 5195166 (1993-03-01), Hardwick et al.
patent: 5216747 (1993-06-01), Hardwick et al.
patent: 5226084 (1993-07-01), Hardwick et al.
patent: 5226108 (1993-07-01), Hardwick et al.
patent: 5247579 (1993-09-01), Hardwick et al.
patent: 5267317 (1993-11-01), Kleijn
patent: 5303346 (1994-04-01), Fesseler et al.
patent: 5327518 (1994-07-01), George et al.
patent: 5327521 (1994-07-01), Savic et al.
patent: 5339164 (1994-08-01), Lim
patent: 5353373 (1994-10-01), Drogo de lacovo et al.
patent: 5369724 (1994-11-01), Lim
patent: 5491772 (1996-02-01), Hardwick et al.
patent: 5517511 (1996-05-01), Hardwick et al.
patent: 5630012 (1997-05-01), Nishiguchi et al.
patent: 5717821 (1998-02-01), Tsutsui et al.
patent: 5765126 (1998-06-01), Tsutsui et al.
Daniel Wayne Griffin and Jae S. Lim, "Multiband Excitation Vocoder," IEEE Trans. on Acoustics, Speech, and Signal Processing, vol. 36, No. 8, pp. 1223-1235, Aug. 1988.
Masayuki Nishiguchi, Jun Matsumoto, Ryoji Wakatsuki, and Shinobu Ono, "Vector Quantized MBE With Simplified V/UV Division at 3.0 Kbps", Proc. IEEE ICASSP '93, vol. II, pp. 151-154, Apr. 1993.
Yeldener, Suat et al., "A High Quality 2.4 Kb/s Multi-Band LPC Vocoder and its Real-Time Implementation". Center for Satellite Enginering Research, University of Surrey. pp. 1-4. Sep. 1992.
Yeldener, Suat et al., "Natural Sounding Speech Coder Operating at 2.4 Kb/s and Below ", 1992 IEEE International Conference as Selected Topics in Wireless Communication, 25-26 Jun. 1992, Vancouver, BC, Canada, pp. 176-179.
Yeldener, Suat et al., "Low Bit Rate Speech Coding at 1.2 and 2.4 Kb/s", IEE Colloquium on Speech Coding--Techniques and Applications" (Digest No. 090) pp. 611-614, Apr. 14, 1992. London, U.K.
Yeldener, Suat et al., "High Quality Multi-Band LPC Coding of Speech at 2.4 Kb/s", Electronics Letters, v.27, N14, Jul. 4, 1991, pp. 1287-1289.
Medan, Yoav, et al., "Super Resolution Pitch Determination of Speech Signals". IEEE Transactions on Signal Processing, vol. 39, No. 1, Jan. 1991.
McAulay, Robert J. et al., "Computationally Efficient Sine-Wave Synthesis and its Application to Sinusoidal Transform Coding" M.I.T. Lincoln Laboratory, Lexington, MA. 1988 IEEE, S9.1 pp. 370-373.
Hardwick, John C., "A 4.8 KBPS Multi-BAND Excitation Speech Coder". M.I.T. Research Laboratory of Electronics; 1988 IEEE, S9.2., pp. 374-377.
Thomson, David L., "Parametric Models of the Magnitude/Phase Spectrum for Harmonic Speech Coding". AT&T Bell Laboratories; 1988 IEEE, S9.3., pp. 378-381.
Marques, Jorge S. et al., "A Background for Sinusoid Based Representation of Voiced Speech". ICASSP 86, Tokyo, pp. 1233-1236.
Trancoso, Isabel M., et al., "A Study on the Relationships Between Stochastic and Harmonic Coding". INESC, ICASSP 86, Tokyo. pp. 1709-1712.
McAulay, Robert J. et al., "Phase Modelling and its Application to Sinusoidal Transform Coding". M.I.T. Lincoln Laboratory, Lexington, MA. 1986 IEEE, pp. 1713-1715.
McAulay, Robert J. et al., "Mid-Rate Coding Based on a Sinusoidal Representation of Speech". Lincoln Laboratory, Massachusetts Institute of Technology, Lexington, MA. 1985 IEEE, pp. 945-948.
Almeida, Luis B., "Variable-Frequency Synthesis: An Improved Harmonic Coding Scheme". 1984, IEEE, pp. 27.5.1-27.5.4.
McAulay, Robert J. et al., "Magnitude-Only Reconstruction Using A Sinusoidal Speech Model", M.I.T. Lincoln Laboratory, Lexington, MA. 1984 IEEE, pp. 27.6.1-27.6.4.
Nats Project; Eigensystem Subroutine Package (EISPACK) F286-2 HQR. "A Fortran IV Subroutine to Determine the Eigenvalues of a Real Upper Hessenberg Matrix", Jul. 1975, pp. 330-337.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Low bit-rate speech coding system and method using voicing proba does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Low bit-rate speech coding system and method using voicing proba, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Low bit-rate speech coding system and method using voicing proba will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1225214

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.