Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Patent
1996-04-12
1999-12-28
Knepper, David D.
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
704265, 704203, G10L 500, G10L 706, G10L 300
Patent
active
RE0364789
ABSTRACT:
A sinusoidal model for acoustic waveforms is applied to develop a new analysis/synthesis technique which characterizes a waveform by the amplitudes, frequencies, and phases of component sine waves. These parameters are estimated from a short-time Fourier transform. Rapid changes in the highly-resolved spectral components are tracked using the concept of "birth" and "death" of the underlying sine waves. The component values are interpolated from one frame to the next to yield a representation that is applied to a sine wave generator. The resulting synthetic waveform preserves the general waveform shape and is perceptually indistinguishable from the original. Furthermore, in the presence of noise the perceptual characteristics of the waveform as well as the noise are maintained. The method and devices are particularly useful in speech coding, time-scale modification, frequency scale modification and pitch modification.
REFERENCES:
patent: 3296374 (1967-01-01), Clapper
patent: 3360610 (1967-12-01), Flanagan
patent: 3484556 (1969-12-01), Flanagan et al.
patent: 3978287 (1976-08-01), Fletcher et al.
patent: 3982070 (1976-09-01), Flanagan
patent: 4034160 (1977-07-01), Van Gerwen
patent: 4058676 (1977-11-01), Wilkes et al.
patent: 4076958 (1978-02-01), Fulghum
patent: 4435832 (1984-03-01), Asada et al.
patent: 4701955 (1987-10-01), Taguchi
Malpass, "The Gold-Rabiner Pitch Detector In A Real Time Environment," Proc. of Eascon (Sep. 1975), pp. 1-7.
Gold, "Description of a Computer Program for Pitch Detection," Fourth International Congress, Copenhagen, Aug. 21-18, 1962.
Gold, "Note On Buzz-Hiss Detection," J. Acoust. Soc. Am, vol. 36, No. 9, 1964, pp. 1659-1661.
Holmes, "The JSRU Channel Vocoder," IEE Proc., vol. 127, No. 1, 1980, pp. 53-60.
Rabiner & Schafer, Digital Processing of Signals, Prentice Hall, 1978, pp. 225-238.
Markell, Linear Prediction of Speech, Springer-Verlog, 1967, pp. 227-262.
Almeida et al., "Variable-Frequency Synthesis: An Improved Harmonic Coding Scheme," IEEE, vol. 2, 1984, pp. 27.5.1-27.5.4.
Crochiere, "A Weighted Overlap-Add Method of Short-time Fourier Analysis/Synthesis," IEEE Trans. on Acoustics, Speech & Sig. Proc., vol. ASSP-28, 1980, pp. 99-102.
Silverman et al., "Transfer Characteristic Estimation for Speech Via Multirate Evaluation," IEEE, pub. 75 CHO 998-5 Eascon, 1975, pp. 181-A to 181-G (7 pages).
"A Tone-Oriented Voice-Excited Vocoder," Hedelin; Chalmers University of Technology, Gothenburg, Sweden, CH1610/5/81, pp. 205-208, IEEE, 1981.
"A Representation of Speech With Partials," Hedelin; 1982 Elmevier Biological Press, The Representation of Speech in the Peripheral Auditory System, R. Carlson & B. Granstrom, pp. 247-250.
Almeida, Luis B. et al., "Harmonic Coding: A Low Bit-Rate, Good Quality Speech Coding Technique," IEEE, 1982, pp. 1664-1667.
Griffin, Daniel W. et al., "A New Model-Based Speech Analysis/Synthesis System," IEEE International Conference on Acoustics, Speech, and Signal Processing, Mar. 26-29, 1985, pp. 513-516.
Griffin, Daniel W. et al., "A New Pitch Detection Algorithm," Proc. of Int. Conf. on Digital Signal Processing, Florence, Italy, Sep. 1984, pp. 395-399.
McAulay Robert J.
Quatieri, Jr. Thomas F.
Knepper David D.
Massachusetts Institute of Technology
LandOfFree
Processing of acoustic waveforms does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Processing of acoustic waveforms, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Processing of acoustic waveforms will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2373750