Speech coding system and method using voicing probability determ

Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

704206, 704219, 704262, 704268, G10L 702, G10L 914

Patent

active

057748373

ABSTRACT:
A modular system and method is provided for encoding and decoding of speech signals using voicing probability determination. The continuous input speech is divided into time segments of a predetermined length. For each segment the encoder of the system computes the signal pitch and a parameter which is related to the relative content of voiced and unvoiced portions in the spectrum of the signal, which is expressed as a ratio Pv, defined as a voicing probability. The voiced portion of the signal spectrum, as determined by the parameter Pv, is encoded using a set of harmonically related amplitudes corresponding to the estimated pitch. The unvoiced portion of the signal is processed in a separate processing branch which uses a modified linear predictive coding algorithm. Parameters representing both the voiced and the unvoiced portions of a speech segment are combined in data packets for transmission. In the decoder, speech is synthesized from the transmitted parameters representing voiced and unvoiced portions of the speech in a reverse order. Boundary conditions between voiced and unvoiced segments are established to ensure amplitude and phase continuity for improved output speech quality. Perceptually smooth transition between frames is ensured by using an overlap and add method of synthesis. Also disclosed is the use of the system in the generation of a variety of voice effects.

REFERENCES:
patent: 4374302 (1983-02-01), Vogten et al.
patent: 4392018 (1983-07-01), Fette
patent: 4433434 (1984-02-01), Mozer
patent: 4435831 (1984-03-01), Mozer
patent: 4435832 (1984-03-01), Asada et al.
patent: 4468804 (1984-08-01), Kates et al.
patent: 4771465 (1988-09-01), Bronson et al.
patent: 4797926 (1989-01-01), Bronson et al.
patent: 4802221 (1989-01-01), Jibbe
patent: 4856068 (1989-08-01), Quatieri, Jr. et al.
patent: 4864620 (1989-09-01), Bialick
patent: 4885790 (1989-12-01), McAulay et al.
patent: 4937873 (1990-06-01), McAulay et al.
patent: 4945565 (1990-07-01), Ozawa et al.
patent: 4991213 (1991-02-01), Wilson
patent: 5023910 (1991-06-01), Thomson
patent: 5054072 (1991-10-01), McAulay et al.
patent: 5081681 (1992-01-01), Hardwick et al.
patent: 5189701 (1993-02-01), Jain
patent: 5195166 (1993-03-01), Hardwick et al.
patent: 5216747 (1993-06-01), Hardwick et al.
patent: 5226084 (1993-07-01), Hardwick et al.
patent: 5226108 (1993-07-01), Hardwick et al.
patent: 5247579 (1993-09-01), Hardwick et al.
patent: 5267317 (1993-11-01), Kleijn
patent: 5303346 (1994-04-01), Fesseler et al.
patent: 5327518 (1994-07-01), George et al.
patent: 5327521 (1994-07-01), Savic et al.
patent: 5339164 (1994-08-01), Lim
patent: 5353373 (1994-10-01), Drogo de Iacovo et al.
patent: 5369724 (1994-11-01), Lim
patent: 5491772 (1996-02-01), Hardwick et al.
patent: 5517511 (1996-05-01), Hardwick et al.
Yeldener, Suat et al., "A High Quality 2.4 kb/s Multi-Band LPC Vocoder and its Real-Time Implementation". Center for Satellite Engineering Research, University of Surrey. pp. 14. Sep. 1992.
Yeldener, Suat et al., "Natural Sounding Speech Coder Operating at 2.4 Kb/s and Below", 1992 IEEE International Conference as Selected Topics in Wireless Communication, 25-26 Jun. 1992, Vancouver, BC, Canada, pp. 176-179.
Yeldener, Suat et al., "Low Bit Rate Speech Coding at 1.2 and 2.4 Kb/s", IEE Colloquium on Speech Coding--Techniques and Applications' (Digest No. 090) pp. 611-614, Apr. 14, 1992. London, U.K.
Yeldener, Suat et al., "High Quality Multi-Band LPC Coding of Speech at 2.4 Kb/s", Electronics Letters, v.27, N14, Jul. 4, 1991, pp. 1287-1289.
Medan, Yoav., "Super Resolution Pitch Determination of Speech Signals". IEEE Transactions on Signal Processing, vol. 39, No. 1, Jan. 1991.
McAulay, Robert J. et al., "Computationally Efficient Sine-Wave Synthesis and its Application to Sinusoidal Transform Coding" M.I.T. Lincoln Laboratory, Lexington, MA. 1988 IEEE, S9.1 pp. 370-373.
Hardwick, John C., "A 4.8 KBPS Multi-Band Excitation Speech Coder". M.I.T. Research Laboratory of Electronics; 1988 IEEE, S9.2., pp. 374-377.
Thomson, David L., "Parametric Models of the Magnitude/Phase Spectrum for Harmonic Speech Coding". AT&T Bell Laboratories; 1988 IEEE, S9.3., pp. 378-381.
Marques, Jorge S. et al., "A Background for Sinusoid Based Representation of Voiced Speech". ICASSP 86, Tokyo, pp. 1233-1236.
Trancoso, Isabel M., et al., "A Study on the Relationships Between Stochastic and Harmonic Coding", INESC, ICASSP 86, Tokyo. pp. 1709-1712.
McAulay, Robert J. et al., "Phase Modelling and its Application to Sinusoidal Transform Coding". M.I.T. Lincoln Laboratory, Lexington, MA. 1986 IEEE, pp. 1713-1715.
McAulay, Robert J. et al., "Mid-Rate Coding Based on a Sinusoidal Representation of Speech". Lincoln Laboratory, Massachusetts Institute of Technology, Lexington, MA. 1985 IEEE, pp. 945-948.
Almeida, Luis B., "Variable-Frequency Synthesis: An Improved Harmonic Coding Scheme". 1984, IEEE, pp. 27.5.1-27.5.4.
McAulay, Robert J. et al., "Magnitude-Only Reconstruction Using A Sinusoidal Speech Model". M.I.T. Lincoln Laboratory, Lexington, MA. 1984 IEEE, pp. 27.6.1-27.6.4.
Nats Project; Eigensystem Subroutine Package (Eispack) F286-2 Hor. "A Fortran IV Subroutine to Determine the Eigenvalues of a Real Upper Hessenberg Matrix", Jul. 1975, pp. 330-337.
Daniel W. Griffin and Jae S. Lim, "Multiband Excitation Vocoder", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 36, No. 8, pp. 1223-1235, Aug. 1988.
Masayuki Nishiguchi Jun Matsumoto, Ryoji Wakatsuki, and Shinobu Ono, "Vector Quantized MBE with Simplified V/UV Division at 3.0 Kbps", Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP '93), vol. II, pp. 141-154, Apr. 1993.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Speech coding system and method using voicing probability determ does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Speech coding system and method using voicing probability determ, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech coding system and method using voicing probability determ will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1872044

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.