Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis
Patent
1997-10-02
2000-12-19
Knepper, David D.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Synthesis
704243, 704244, 704245, 704255, 704256, 704257, 704258, 704260, 704266, 704267, 704268, 704269, G10L 1300
Patent
active
061637696
ABSTRACT:
A text-to-speech system includes a storage device for storing a clustered set of context-dependent phoneme-based units of a target speaker. In one embodiment, decision trees are used wherein each decision tree based context-dependent phoneme-based unit is arranged based on context of at least one immediately preceding and succeeding phoneme. At least one of the context-dependent phoneme-based units represents other non-stored context-dependent phoneme units of similar sound due to similar contexts. A text analyzer obtains a string of phonetic symbols representative of text to be converted to speech. A concatenation module selects stored decision tree based context-dependent phoneme-based units from the set decision tree based context-dependent phoneme-based units based on the context of the phonetic symbols and synthesizes the selected phoneme-based units to generate speech corresponding to the text.
REFERENCES:
patent: 4852173 (1989-07-01), Bahl et al.
patent: 4979216 (1990-12-01), Malsheen et al.
patent: 5153913 (1992-10-01), Kandefer et al.
patent: 5384893 (1995-01-01), Hutchins
patent: 5636325 (1997-06-01), Farrett
patent: 5794197 (1998-08-01), Alleva et al.
Nakajima, S., Hamada, H., "Automatic Generation of Synthesis Units Based on Context Oriented Clustering", IEEE International Conference on Acoustics, Speech, and Signal Processing, New York, Apr. 1988, pp. 659-662.
Ney, H., Heab-Umbach, R., Tran, B.H., Oerder, M., "Improvements in Beam Search for 10000-Word Continuous Speech Recognition", IEEE International Conference on Acoustics, Speech, and Signal Processing, California, Mar. 1992, pp. I-9--I-12.
Emerard, F., Mortamet, L., Cozannet, A., "Prosodic processing in a text-to-speech synthesis system using a database and learning procedures", Talking Machines: Theories, Models, and Designs, 1992, pp. 225-254.
Riley, M., "Tree-based modelling of segmental durations", Talking Machines: Theories, Models, and Designs, 1992, pp. 265-273.
Hwang, M.Y., Huang X., Alleva, F., "Predicting Unseen Triphone with Senones", IEEE International Conference on Acoustics, Speech, and Signal Processing, Minnesota, Apr., 1993, pp. II-311--II-314.
Donovan, R.E., Woodland, P.C., "Improvements in an HMM-Based Speech Synthesiser", Proceedings of European Conference on Speech Communication and Technology, Madrid, Spain, Sep. 1995, pp. 573-576.
Huang, X., Acero, A., Alleva F., Hwang, M.Y., Jiang, L., Mahajan, M., "Microsoft Windows Highly Intelligent Speech Recognizer: Whisper", IEEE International Conference on Acoustics, Speech, and Signal Processing, Detroit, 1995, pp. 1-5.
Alleva, F., Xuedong, H., Hwang, M.Y., "Improvements on the Pronunciation Prefix Tree Search Organization", IEEE International Conference on Acoustics, Speech, and Signal Processing, Georgia, May 1996, pp. 133-136.
Hsiao-Wuen et al., "CMU Robust Vocabulatory-Independent Speech Recognition System", IEEE International Conference on Acoustics, Speech and Signal Processing, Toronto, Canada, 1991, pp. 889-892.
Young et al., "Tree-Based State Tying for High-Accuracy Acoustic Modelling" ARPA Workshop on Human Language Technology, Merrill Lynch Conference Centre, pp 307-312, 1994.
Acero Alejandro
Hon Hsiao-Wuen
Huang Xuedong D.
Knepper David D.
Koehler S.
Microsoft Corporation
LandOfFree
Text-to-speech using clustered context-dependent phoneme-based u does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Text-to-speech using clustered context-dependent phoneme-based u, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Text-to-speech using clustered context-dependent phoneme-based u will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-277625