Patent
1995-07-11
1997-09-23
MacDonald, Allen R.
395 216, 395 273, 395 276, G10L 504
Patent
active
056713304
ABSTRACT:
A speech synthesis system making use of a pitch-synchronous waveform overlap method to realize stable speech synthesis processing in which pitch shaking is negligible. The present invention is characterized in that glottal closure instants are used as reference points (pitch marks) for overlapping. Since the glottal closure instants can be extracted stably and accurately by using dyadic Wavelet conversion, speech in which pitch shaking is negligible and rumbling sounds are minimized can be synthesized stably. In addition, more flexible waveform separation becomes possible by setting the reference point for overlapping and the reference point for waveform separation to different positions. The extraction of glottal closure instants is performed by searching the local peaks of the dyadic Wavelet conversion, but preferably a threshold value for searching for the local peaks of the dyadic Wavelet conversion is adaptively controlled each time dyadic Wavelet conversion is obtained.
REFERENCES:
patent: 5054085 (1991-10-01), Meisel et al.
patent: 5175769 (1992-12-01), Hejna, Jr. et al.
patent: 5479564 (1995-12-01), Vogten et al.
patent: 5524172 (1996-06-01), Hamon
patent: 5581652 (1996-12-01), Abe et al.
Stephane Mallat and Sifen Zhong, "Characterization of Signals from Multiscale Edges," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 14, No. 7, pp. 710-732. Jul. 1992.
Gianpolo Evangelista, "Pitch-Synchronous Wavelet Representation of Speech and Music Signals," IEEE Transactions on Signal Processing, vol. 41, No. 12, pp. 3313-3330. Dec. 1993.
Lunji Qiu, Soo-Ngee Koh, and Hayun Yang, "Pitch Determination of Noisy Speech Using Wavelet Transform in Time and Frequency Domains," Proceedings of IEEE TENCON '93, pp. 337-340. Oct. 1993.
Glenn A. Shelby, Christopher M. Cooper, and Reza R. Adhami, "A Wavelet-Base Speech Pitch Detector for Tone Languages," Proceedings of the IEEE-SP International Symposium on Time-Frequency and Time-Scale Analysis, pp. 596-599. Oct. 1994.
William J. Pielemeier, Gregory H. Wakefield, and Mary H. Simoni, "Time-Frequency Analysis of Musical Signals," Proc. IEEE, vol. 84, No. 9, pp. 1216-1230. Sep. 1996.
Kobayashi Mei
Nishimura Masafumi
Saito Takashi
Sakamoto Masaharu
International Business Machines - Corporation
MacDonald Allen R.
Smits Talivadis Ivars
Tassinari, Jr. Robert P.
LandOfFree
Speech synthesis using glottal closure instants determined from does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Speech synthesis using glottal closure instants determined from , we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Speech synthesis using glottal closure instants determined from will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1941440