Data processing: speech signal processing – linguistics – language – Speech signal processing – Application
Patent
1996-03-15
1998-05-05
MacDonald, Allen R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Application
704203, 704206, 704209, 704241, 704265, 704270, G10L 302
Patent
active
057490732
ABSTRACT:
In the first step of a sound morphing process, each sound which forms the basis for the morph is converted into one or more quantitative representations, such as spectrograms. After the representations have been obtained, the temporal axes of the two sounds are matched, so that similar components of the two sounds, such as onsets, harmonic regions and inharmonic regions, are aligned with one another. Other characteristics of the sounds, such as pitch, formant frequencies, or the like, are then matched. Once the energy in each of the sounds has been accounted for and matched to that of the other sound, the two sounds are cross-faded, to produce a representation of a new sound. This representation is then inverted, to generate the morphed sound.
REFERENCES:
patent: 4706537 (1987-11-01), Oguri
patent: 5097326 (1992-03-01), Meijer
patent: 5291557 (1994-03-01), Davis et al.
patent: 5327521 (1994-07-01), Savic et al.
patent: 5371315 (1994-12-01), Hanzawa et al.
patent: 5473759 (1995-12-01), Slaney et al.
patent: 5583961 (1996-12-01), Pawlewski et al.
patent: 5625749 (1997-04-01), Goldenthal et al.
Davis, Stephen B., et al, "Comparison of parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences", IEEE Transactions of Acoustics, Speech, and Signal Processing, vol. ASSP-28, No. 4, 4, Aug. 1980.
Van Immerseel, Luc M., et al, "Pitch and voiced/unvoiced determination with an auditory model", J. Acoust. Soc. Am. 91 (6), Jun. 1992, 1992 Acoustical Society of America, pp. 3511-3526.
White, George M., et al, "Speech Recognition Experiments with Linear Prediction, Bandpass Filtering, and Dynamic Programming", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-24, No. 2, Apr. 1976, pp. 183-188.
Yong, Mei, "A New LPC Interpolation Technique for CELP Coders", IEEE Transactions on Communications, vol. 42, No. 1, Jan. 1994, pp. 34-38.
"Morpheus Z-Plan Synthesizer", E-mu Systems, Inc.
Oberheim Digital Presents a Technology Dossier On Fourier analysis Resynthesis, 1994, pp. 1-16.
World Wide Web Home Page for Voxware, Inc., describing the Morph-Kit voice utility.
Announcement for Sound Morph program for Macintosh.
Amini, Amir A., et al, "Using Dynamic Programming for Solving Variational Problems in Vision", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 12, No. 9, Sep. 1990, pp. 855-867.
Beier, Thaddeus, et al, "Feature-Based Image Metamorphosis", SIGGRAPH '92, Chicago, Jul. 26-31, 1992, p. 35-42
Blinn, James F., "What's the Deal with DCT?", IEEE Computer Graphics & Applications, Jul. 1993, pp. 78-83.
Bruderlin, Armin, et al, "Motion Signal Processing", Computer Graphics & Proceedings, Annual Conference Series, 1995, pp. 97-104.
Covell, Michele, et al, "Spanning the Gap Between Motion Estimation and Morphing", Interval Research Corporation, 1994, pp. V-213-V-216.
Deller et al, "Dynamic Time Wraping", Discrete-time Processing of Speech Signals, New York, Macmillan Pub. Co., 1993, pp. 623-676.
Depalle, Philippe, et al, "Tracking of Partials for Additive Sound Synthesis Using Hidden Markov Models", IRCAM, pp. I-225-I-228.
Griffin, Daniel W., et al, "Signal Estimation from Modified Short-Time Fourier Transform", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-32, No. 2,Apr. 1984, pp. 236-243.
Hunt, M, J., et al, "Experiments in Syllable-Based Recognition of Continuous Speech", Bell-Northern Research, Apr. 1980, pp. 880-883.
Savic, Michael et al, "Voice Personality Transformation", Digital Signal Processing 1, 107-110 (1991).
Secrest, Bruce, et al, "An Integrated Pitch Tracking Algorithm for Speech Systems", Texax Instruments, Inc., ICASSP 83, Boston, pp. 1352-1355.
Tellman, Edwin, et al, "Timbre Morphing of Sounds with Unequal Numbers of Features", CERL Sound Group, University of Illinois, rev. May 1, 1995, pp. 1-12.
Valbret, H., et al, "Voice transformation using PSOLA tehnique", Speech Communication, vol. 11, Nos. 2-3, Jun. 1992, pp. 175-187
Collins Alphonso A.
Interval Research Corporation
MacDonald Allen R.
LandOfFree
System for automatically morphing audio information does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with System for automatically morphing audio information, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System for automatically morphing audio information will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-71559