Methods for generating the voiced portion of speech signals

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

G10L 900

Patent

active

051951665

ABSTRACT:
The pitch estimation method is improved. Sub-integer resolution pitch values are estimated in making the initial pitch estimate; the sub-integer pitch values are preferably estimated by interpolating intermediate variables between integer values. Pitch regions are used to reduce the amount of computation required in making the initial pitch estimate. Pitch-dependent resolution is used in making the initial pitch estimate, with higher resolution being used for smaller values of pitch. The accuracy of the voiced/unvoiced decision is improved by making the decision dependent on the energy of the current segment relative to the energy of recent prior segments; if the relative energy is low, the current segment favors an unvoiced decision; if high, it favors a voiced decision. Voiced harmonics are generated using a hybrid approach; some voiced harmonics are generated in the time domain, whereas the remaining harmonics are generated in the frequency domain; this preserves much of the computational savings of the frequency domain approach, while at the same time improving speech quality. Voiced harmonics generated in the frequency domin are generated with higher frequency accuracy; the harmonics are frequency sealed, transformed into the time domain with a Discrete Fourier Transform, interpolated and then time scaled.

REFERENCES:
patent: 3982070 (1976-09-01), Flanagan
patent: 3995116 (1976-11-01), Flanagan
patent: 4076958 (1978-02-01), Fulghum
patent: 4797926 (1989-01-01), Bronson et al.
patent: 4829574 (1989-05-01), Dewhurst et al.
patent: 4856068 (1989-08-01), Quatieri et al.
Griffin, et al., "A New Pitch Detection Algorithm", Digital Signal Processing, No. 84, pp. 395-399, 1984, Elsevier Science Publishers.
Griffin, et al., "A New Model-Based Speech Analysis/Synthesis Symstem", IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1985, pp. 513-516.
McAulay, et al., "Mid-Rate Coding Based on a Sinusoidal Representation of Speech", IEEE 1985, pp. 945-948.
McAulay, et al., "Computationally Efficient Sine-Wave and Its Application to Sinusoidal Transform Coding", IEEE 1988, pp. 370-373.
Hardwick, "A 4.8 Kbps Multi-Band Excitation Speech Coder", Thesis for Degree of Master of Science in Electrical Engineering and Computer Science, Massachusetts Institute of Technology, May 1988, pp. 1-68.
Griffin, "Multi-Band Excitation Vocoder", Thesis for Degree of Doctor of Philosophy, Massachusetts Institute of Technology, Feb. 1987, pp. 1-131.
Portnoff, "Short-Time Fourier Analysis of Samples Speech", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-29, No. 3, Jun. 1981, pp. 324-333.
Griffin, et al., "Signal Estimation from Modified Short-Time Fourier Transform", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-32, No. 2, Apr. 1984, pp. 236-243.
Almeida, et al., "Harmonic Coding: A Low Bit-Rate, Good-Quality Speech Coding Technique", IEEE (1982) CH1746/7/82, pp. 1664-1667.
Quatieri, et al., "Speech Transformations Based on a Sinusoidal Representation", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-34, No. 6, Dec. 1986, pp. 1449-1464.
Griffin, et al., "Multiband Excitation Vocoder", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 36, No. 8, Aug., 1988, pp. 1223-1235.
Almeida, et al., "Variable-Frequency Synthesis: An Improved Harmonic Coding Schemes", ICASSP 1984, pp. 27.5.1-27.5.4.
Flanagan, J. L., Speech Analysis Synthesis and Perception, Springer-Verlag, 1982, pp. 378-386.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Methods for generating the voiced portion of speech signals does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Methods for generating the voiced portion of speech signals, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Methods for generating the voiced portion of speech signals will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-357003

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.