Synthesis of MBE-based coded speech using regenerated phase info

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

395 214, 395 217, 395 232, 395 273, 395 275, G10L 702

Patent

active

057013900

ABSTRACT:
A method for decoding and synthesizing a synthetic digital speech signal from digital bits of the type produced by dividing a speech signal into frames and encoding the speech signal by an MBE based encoder. The method includes the steps of decoding the bits to provide spectral envelope and voicing information for each of the frames, processing the spectral envelope information to determine regenerated spectral phase information for each of the frames based on local envelope smoothness determining from the voicing information whether frequency bands for a particular frame are voiced or unvoiced. The method further includes synthesizing speech components for voiced frequency bands using the regenerated spectral phase information, synthesizing a speech component representing the speech signal in at least one unvoiced frequency band, and synthesizing the speech signal by combining the synthesized speech components for voiced and unvoiced frequency bands.

REFERENCES:
patent: 3706929 (1972-12-01), Robinson et al.
patent: 3975587 (1976-08-01), Dunn et al.
patent: 3982070 (1976-09-01), Flanagan
patent: 3995116 (1976-11-01), Flanagan
patent: 4004096 (1977-01-01), Bauer et al.
patent: 4015088 (1977-03-01), Dubnowski et al.
patent: 4074228 (1978-02-01), Jonscher
patent: 4076958 (1978-02-01), Fulghum
patent: 4091237 (1978-05-01), Wolnowsky et al.
patent: 4441200 (1984-04-01), Fette et al.
patent: 4618982 (1986-10-01), Horvath et al.
patent: 4622680 (1986-11-01), Zinser
patent: 4672669 (1987-06-01), Des Blache et al.
patent: 4696038 (1987-09-01), Doddington et al.
patent: 4720861 (1988-01-01), Bertrand
patent: 4797926 (1989-01-01), Bronson et al.
patent: 4799059 (1989-01-01), Grindahl et al.
patent: 4809334 (1989-02-01), Bhaskar
patent: 4813075 (1989-03-01), Ney
patent: 4879748 (1989-11-01), Picone et al.
patent: 4885790 (1989-12-01), McAulay et al.
patent: 4989247 (1991-01-01), Van Hemert
patent: 5023910 (1991-06-01), Thomson
patent: 5036515 (1991-07-01), Freeburg
patent: 5054072 (1991-10-01), McAulay et al.
patent: 5067158 (1991-11-01), Arjmand
patent: 5081681 (1992-01-01), Hardwick
patent: 5091944 (1992-01-01), Takahashi
patent: 5095392 (1992-03-01), Shimazaki et al.
patent: 5179626 (1993-01-01), Thomson
patent: 5195166 (1993-03-01), Hardwick et al.
patent: 5216747 (1993-06-01), Hardwick et al.
patent: 5226084 (1993-07-01), Hardwick et al.
patent: 5226108 (1993-07-01), Hardwick et al.
patent: 5247579 (1993-09-01), Hardwick et al.
patent: 5265167 (1993-11-01), Akamine et al.
patent: 5517511 (1996-05-01), Hardwick et al.
Cox et al., "Subband Speech Coding and Matched Convolutional Channel Coding for Mobile Radio Channels," IEEE Trans. Signal Proc., vol. 39, No. 8 (Aug. 1991), pp. 1717-1731.
Digital Voice Systems, Inc., "The DVSI IMBE Speech Compression System," advertising brochure (May 12, 1993).
Digital Voice Systems, Inc., "The DVSI IMBE Speech Coder," advertising brochure (May 12, 1993).
Fujimura, "An Approximation to Voice Aperiodicity", IEEE Transactions on Audio and Electroacoutics, vol. AU-16, No. 1 (Mar. 1968), pp. 68-72.
Griffin, "The Multiband Excitation Vocoder", Ph.D. Thesis, M.I.T., 1987.
Hardwick et al., "The Application of the IMBE Speech Coder to Mobile Communications," IEEE (1991), pp. 249-252 ICASSP 91 May 1991.
Heron, "A 32-Band Sub-band/Transform Coder Incorporating Vector Quantization for Dynamic Bit Allocation", IEEE (1983), pp. 1276-1279.
Makhoul, "A Mixed-Source Model for Speech Compression And Synthesis", IEEE (1978), pp. 163-166 ICASSP 78.
Maragos et al., "Speech Nonlinearities, Modulations, and Energy Operators", IEEE (1991), pp. 421-424 ICASSP 91 May 1991.
Quackenbush et al., "The Estimation And Evaluation Of Pointwise Nonlinearities For Improving The Performance Of Objective Speech Quality Measures", IEEE (1983), pp. 547-550 ICASSP, 83.
McCree et al., "A New Mixed Excitation LPC Vocoder", IEEE (1991), p. 593-595 ICASSP 91 May 1991.
McCree et al., "Improving The Performance Of A Mixed Excitation LPC Vocoder In Acoustic Noise", IEEE ICASSP 92 Mar. 1992.
Griffin et al., "Multiband Excitation Vocoder" IEEE Transactions on Acoustics, Speech and Signal processing, vol. 36, No. 8, pp. 1223-1235 (1988).
Almeida et al., "Harmonic Coding: A Low Bit-Rate, Good-Quality Speech Coding Technique," IEEE (CH 1746-7/82/0000 1684) pp. 1664-1667 (1982).
Tribolet et al., "Frequency Domain Coding of Speech," IEEE Transactions on Acoustics, Speech and Signal Processing, V. ASSP-27, No. 5, pp. 512-530 (Oct. 1979).
McAulay et al., "Speech Analysis/Synthesis Based on A Sinusoidal Representaton," IEEE Transactions on Acoustics, Speech and Signal Processing V. 34, No. 4, pp. 744-754, (Aug. 1986).
Griffin, et al. "A New Pitch Detection Algorithm", Digital Signal Processing, No. 84, pp. 395-399.
McAulay, et al., "Computationally Efficient Sine-Wave Synthesis and Its Application to Sinusoidal Transform Coding", IEEE 1988, pp. 370-373.
Portnoff, "Short-Time Fourier Analysis of Sampled Speech", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-29, No. 3, Jun. 1981, pp. 324-333.
Griffin et al. "Signal Estimation from modified Short t-Time Fourier Transform", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-32, No. 2, Apr. 1984, pp. 236-243.
Almeida, et al. "Variable-Frequency Synthesis: An Improved Harmonic Coding Scheme", ICASSP 1984 pp. 27.5.1-27.5.4.
Flanagan, J.L., Speech Analysis Synthesis and Perception, Springer-Verlag, 1982, pp. 378-386.
Secrest, et al., "Postprocessing Techniques for Voice Pitch Trackers", ICASSP, vol. 1, 1982, pp. 171-175.
Patent Abstracts of Japan, vol. 14, No. 498 (P-1124), Oct. 30, 1990.
Mazor et al., "Transform Subbands Coding With Channel Error Control", IEEE 1989, pp. 172-175.
Brandstein et al., "A Real-Time Implementation of the Improved MBE Speech Coder", IEEE 1990, pp. 5-8.
Levesque et al., "A Proposed Federal Standard for Narrowband Digital Land Mobile Radio", IEEE 1990, pp. 497-501.
Yu et al., "Discriminant Analysis and Supervised Vector Quantization for Continuous Speech Recognition", IEEE 1990, pp. 685-688.
Jayant et al., Digital Coding of Waveform, Prentice-Hall, 1984.
Atungsiri et al., "Error Detection and Control for the Parametric Information in CELP Coders", IEEE 1990, pp. 229-232.
Digital Voice Systems, Inc., "Inmarsat-M Voice Coder", Version 1.9, Nov. 18, 1992.
Campbell et al., "The New 4800 bps Voice Coding Standard", Mil Speech Tech Conference, Nov. 1989.
Chen et al., "Real-Time Vector APC Speech Coding at 4800 bps with Adaptive Postfiltering", Proc. ICASSP 1987, pp. 2185-2188.
Jayant et al., "Adaptive Postfiltering of 16 kb/s-ADPCM Speech", Proc. ICASSP 86, Tokyo, Japan, Apr. 13-20, 1986, pp. 829-832.
Makhoul et al., "Vector Quantization in Speech Coding", Proc. IEEE, 1985, pp. 1551-1588.
Rahikka et al., "CELP Coding for Land Mobile Radio Applications," Proc. ICASSP 90, Albuquerque, New Mexico, Apr. 3-6, 1990, pp. 465-468.
Quatieri, et al. "Speech Transformations Based on A Sinusoidal Representation", IEEE, TASSP, vol., ASSP34 No. 6, Dec. 1986, pp. 1449-1464.
Griffin, et al., "A High Quality 9.6 Kbps Speech Coding System", Proc. ICASSP 86, pp. 125-128, Tokyo, Japan, Apr. 13-20, 1986.
Griffin et al., "A New Model-Based Speech Analysis/Synthesis System", Proc. ICASSP 85 pp. 513-516, Tampa. FL., Mar. 26-29, 1985.
Hardwick, "A 4.8 kbps Multi-Band Excitation Speech Coder", S.M. Thesis, M.I.T. May 1988.
McAulay et al., "Mid-Rate Coding Based on a Sinusoidal Representation of Speech", Proc. IEEE 1985 pp. 945-948.
Hardwick et al. "A 4.8 Kbps Multi-band Excitation Speech Coder," Proceedings from ICASSP, International Conference on Acoustics, Speech and Signal Processing, New York, N.Y., Apr. 11-14, pp. 374-377 (1988).

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Synthesis of MBE-based coded speech using regenerated phase info does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Synthesis of MBE-based coded speech using regenerated phase info, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Synthesis of MBE-based coded speech using regenerated phase info will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1806903

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.