1993-08-31
1994-07-05
Knepper, David D.
395 212, 395 2, G10L 300
Patent
active
053275217
ABSTRACT:
A high quality voice transformation system and method operates during a training mode to store voice signal characteristics representing target and source voices. Thereafter, during a real time transformation mode, a signal representing source speech is segmented into overlapping segments, analyzed to separate the excitation spectrum from the tone quality spectrum. A stored target tone quality spectrum is substituted for the source spectrum and then convolved with the actual source speech excitation spectrum to produce a transformed speech signal having the word and excitation content of the source, but the acoustical characteristics of a target speaker. The system may be used to enable a talking, costumed character, or in other applications where a source speaker wishes to imitate the voice characteristics of a different, target speaker.
REFERENCES:
patent: 4058676 (1987-11-01), Wilkes et al.
patent: 4400591 (1983-08-01), Jennings et al.
patent: 4667340 (1987-05-01), Arjmand et al.
patent: 4683588 (1987-07-01), Goldberg
patent: 4815135 (1989-03-01), Taguchi
patent: 4827516 (1989-05-01), Tsukahara et al.
patent: 4856068 (1989-08-01), Quatieri et al.
patent: 4864626 (1989-09-01), Yang
patent: 4885790 (1989-12-01), McAulay et al.
patent: 4937873 (1990-06-01), McAulay et al.
patent: 5029211 (1991-07-01), Ozawa
patent: 5113449 (1992-05-01), Blanton et al.
ICASSP'91 (1991 International Conference on Acoustics, Speech and Signal Processing, Toronto, Ontario, 14-17 May 1991), vol. 2, IEEE, (New York, US), M. ABE: "A segment-based approach to voice conversion", pp. 765-768, see p. 765, right-hand column, lines 2-28.
ICASSP'88 (1988) International Conference on Acoustics, Speech, and Signal Processing, New York, 11-14 Apr. 1988), vol. 1, IEEE, (New York, US), V. Goncharoff et al.: "Adaptive speech modification by spectral warping", pp. 343-346, see paragraph 2: Spectral envelope modification, figure 1.
Systems and Computers in Japan, vol. 21, No. 10, 1990 (New York, US), M. Abe et al.: "A speech modification method by signal reconstruction using short-tern Fourier transform", pp. 26-33, see figure 1.
IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-28, No. 1, Feb. 1980, (New York, US), R. E. Crochiere: "A weighted overlap-add method of short-time Fourier analysis/synthesis", pp. 99-102, see abstract: figure 2.
Onzieme Colloque sur le Traitement du Signal et des Images (Nice, 1-5 Jun. 1987), Gretsi, (Paris, FR), J. Crestel et al.: "Un systeme pour l'amelioration des communications en plongee profonde", pp. 435-438, see figure 2.
A. Oppenheim and R. Schafer, Digital Signal Processing, Prentice-Hall, (1975), pp. 284-327.
L. Rabiner and R. Schafer, Digital processing of speech Signals, Prentice-Hall, (1978), pp. 303-306.
L. Rabiner and R. Schafer, Digital Processing of Speech Signals, Prentice-Hall, (1978), pp. 411-413.
S. Roucos and A. Wilgus, "High Quality Time-Scale Modification for Speech," IEEE International Conference on Acoustic, Speech and Signal Processing, CH2118-8/85/0000-0493, pp. 493-496, (Mar. 26-29, 1985).
M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Voice Conversion Through Vector Quantization", IEEE International Conference on Acoustics, Speech and Signal Processing, (Apr. 1988), pp. 655-658.
M. Abe, S. Tamura and H. Kuwabara, "A New Speech Modification Method by Signal Reconstruction", IEEE International Conference on Acoustic, Speech, and Signal Processing, (Apr. 1989), pp. 592-595.
L. Almeida and F. Silva, "Variable-Frequency Synthesis: An Improved Harmonic Coding Scheme", Proceedings of the IEEE International Conference on Acoustic, Speech, and Signal Processing, (Mar. 1984), pp. 27.5.1-27.5.4.
H. Bonneau and J. Gauvain, "Vector Quantization for Speaker Adaption", Proceedings of the IEEE International Conference on Acoustic, Speech and Signal Processing, (Apr. 1987), pp. 1434-1437.
D. Childers, "Talking Computers: Replacing Mel Blanc", Computers in Mechanical Engineering, vol. 6, No. 2 (Sep./Oct. 1987), pp. 22-31.
D. Childers, K. Wu, D. Hicks, and B. Yegnanarayana, "Voice Conversion", Speech Communication 8, (1989), pp. 147-158.
D. Childers, B. Yegnanarayana, and K. Wu, "Voice Conversion: Factors Responsible for Quality", Proceedings of the IEEE International Conference on Acoustic, Speech, and Signal Processing, (Mar. 1985) pp. 748-751.
D. Griffin and J. Lim, "Signal Estimation from Modified Short-Time Fourier Transform", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-32, No. 2, (Apr. 1984), pp. 236-243.
J. Jaschul, "An Approach to Speaker Normalization for Automatic Speech Recogniation", Proceedings of the IEEE International Conference on Acoustic, Speech, and Signal Processing, (Apr. 1979) pp. 235-238.
M. Portnoff, "Time-Scale Modification of Speech Based on Short-Time Fourier Analysis", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-29, No. 3, (Jun. 1981), pp. 374-390.
T. Quatieri and R. McAulay, "Apeech Transformations Based on a Sinusoidal Representation", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-34, No. 6, (Dec. 1986), pp. 1449-1461.
M. Ross, H. Shaffer, A. Cohen, F. Freudberg and H. Manley, "Average Magnitude Difference Function Pitch Extractor", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-30, No. 5, (Oct. 1974), pp. 353-362.
S. Seneff, "System to Independently Modify Excitation and/or Spectrum of Speech Waveform Without Explicit Pitch Extraction", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-30 No. 4, (Aug. 1982), pp. 566-578.
S. Seneff, "Speech Transformation System (Spectrum and/or Excitation) Without Pitch Extraction", Massachusette Institute of Technology, Lincoln Laboratory, Technical Report 541, (Jul. 1980).
L. Rabiner, M. Cheng, A. Rosenberg, and C. McGonegal, "A Comparative Performance Study of Several Pitch Detection Algorithms", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 24, No. 5, (Oct. 1976), pp. 399-404.
J. Markel and A. Gray, Jr., linear prediction of Speech, Springer-Verlag, (1982).
Nam Il-Hyun
Savic Michael I.
Tan Seow-Hwee
Knepper David D.
The Walt Disney Company
LandOfFree
Speech transformation system does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speech transformation system, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech transformation system will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-802819