1991-08-22
1994-07-05
Fleming, Michael R.
395 27, G10L 902
Patent
active
053275187
ABSTRACT:
A method and apparatus for the automatic analysis, synthesis and modification of audio signals, based on an overlap-add sinusoidal model, is disclosed. Automatic analysis of amplitude, frequency and phase parameters of the model is achieved using an analysis-by-synthesis procedure which incorporates successive approximation, yielding synthetic waveforms which are very good approximations to the original waveforms and are perceptually identical to the original sounds. A generalized overlap-add sinusoidal model is introduced which can modify audio signals without objectionable artifacts. In addition, a new approach to pitch-scale modification allows for the use of arbitrary spectral envelope estimates and addresses the problems of high-frequency loss and noise amplification encountered with prior art methods. The overlap-add synthesis method provides the ability to synthesize sounds with computational efficiency rivaling that of synthesis using the discrete short-time Fourier transform (DSTFT) while eliminating the modification artifacts associated with that method.
REFERENCES:
patent: 4856068 (1989-08-01), Quatieri, Jr. et al.
patent: 4885790 (1989-12-01), McAulay et al.
patent: 4937873 (1990-06-01), McAulay et al.
patent: 5054072 (1991-10-01), McAulay et al.
Robert J. McAulay and Thomas F. Quatieri, "Pitch Estimation and Voicing Detection Based on a Sinusoidal Speech Model," IEEE Int'l Conf. on Acoustics, Speech and Signal Processing, pp. 249-252 (Apr. 1990).
Thomas F. Quatieri and Robert J. McAulay, "Phase Coherence in Speech Reconstruction for Enhancement and Coding Applications," IEEE Int'l Conf. on Acoustics, Speech and Signal Processing, Glasgow, Scotland, pp. 207-209 (May 1989).
Robert J. McAulay and Thomas F. Quatieri, "Computationally Efficient Sine-Wave Synthesis and Its Application to Sinusoidal Transform Coding," IEEE Int'l Conf. on Acoustics, Speech and Signal Processing, pp. 370-373 (Apr. 1988).
Thomas F. Quatieri and Robert J. McAulay, "Mixed-Phase Deconvolution of Speech Based on a Sine-Wave Model," IEEE Int'l Conf. on Acoustics, Speech and Signal Processing, pp. 649-652 (Apr. 1987).
Thomas F. Quatieri and Robert J. McAulay, "Speech Transformations Based on a Sinusoidal Representation," IEEE Transactions on Acoustics, Speech and Signal Processing, pp. 1449-1464, vol. ASSP-34, No. 6 (Dec. 1986).
Robert J. McAulay and Thomas F. Quatieri, "Speech Analysis/Synthesis Based on a Sinusoidal Representation," IEEE Transactions on Acoustics, Speech and Signal Processing, pp. 744-754, vol. ASSP-34, No. 4 (Aug. 1986).
R. J. McAulay and T. F. Quatieri, "Phase Modeling and Its Application to Sinusoidal," IEEE Int'l Conf. on Acoustics, Speech and Signal Processing, Tokyo, Japan, pp. 1713-1715 (Apr. 1986).
George E. Bryan
Smith Mark J. T.
Doerrler Michelle
Fleming Michael R.
Georgia Tech Research Corporation
LandOfFree
Audio analysis/synthesis system does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Audio analysis/synthesis system, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Audio analysis/synthesis system will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-802747