Text-to-speech using clustered context-dependent phoneme-based u

Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

704243, 704244, 704245, 704255, 704256, 704257, 704258, 704260, 704266, 704267, 704268, 704269, G10L 1300

Patent

active

061637696

ABSTRACT:
A text-to-speech system includes a storage device for storing a clustered set of context-dependent phoneme-based units of a target speaker. In one embodiment, decision trees are used wherein each decision tree based context-dependent phoneme-based unit is arranged based on context of at least one immediately preceding and succeeding phoneme. At least one of the context-dependent phoneme-based units represents other non-stored context-dependent phoneme units of similar sound due to similar contexts. A text analyzer obtains a string of phonetic symbols representative of text to be converted to speech. A concatenation module selects stored decision tree based context-dependent phoneme-based units from the set decision tree based context-dependent phoneme-based units based on the context of the phonetic symbols and synthesizes the selected phoneme-based units to generate speech corresponding to the text.

REFERENCES:
patent: 4852173 (1989-07-01), Bahl et al.
patent: 4979216 (1990-12-01), Malsheen et al.
patent: 5153913 (1992-10-01), Kandefer et al.
patent: 5384893 (1995-01-01), Hutchins
patent: 5636325 (1997-06-01), Farrett
patent: 5794197 (1998-08-01), Alleva et al.
Nakajima, S., Hamada, H., "Automatic Generation of Synthesis Units Based on Context Oriented Clustering", IEEE International Conference on Acoustics, Speech, and Signal Processing, New York, Apr. 1988, pp. 659-662.
Ney, H., Heab-Umbach, R., Tran, B.H., Oerder, M., "Improvements in Beam Search for 10000-Word Continuous Speech Recognition", IEEE International Conference on Acoustics, Speech, and Signal Processing, California, Mar. 1992, pp. I-9--I-12.
Emerard, F., Mortamet, L., Cozannet, A., "Prosodic processing in a text-to-speech synthesis system using a database and learning procedures", Talking Machines: Theories, Models, and Designs, 1992, pp. 225-254.
Riley, M., "Tree-based modelling of segmental durations", Talking Machines: Theories, Models, and Designs, 1992, pp. 265-273.
Hwang, M.Y., Huang X., Alleva, F., "Predicting Unseen Triphone with Senones", IEEE International Conference on Acoustics, Speech, and Signal Processing, Minnesota, Apr., 1993, pp. II-311--II-314.
Donovan, R.E., Woodland, P.C., "Improvements in an HMM-Based Speech Synthesiser", Proceedings of European Conference on Speech Communication and Technology, Madrid, Spain, Sep. 1995, pp. 573-576.
Huang, X., Acero, A., Alleva F., Hwang, M.Y., Jiang, L., Mahajan, M., "Microsoft Windows Highly Intelligent Speech Recognizer: Whisper", IEEE International Conference on Acoustics, Speech, and Signal Processing, Detroit, 1995, pp. 1-5.
Alleva, F., Xuedong, H., Hwang, M.Y., "Improvements on the Pronunciation Prefix Tree Search Organization", IEEE International Conference on Acoustics, Speech, and Signal Processing, Georgia, May 1996, pp. 133-136.
Hsiao-Wuen et al., "CMU Robust Vocabulatory-Independent Speech Recognition System", IEEE International Conference on Acoustics, Speech and Signal Processing, Toronto, Canada, 1991, pp. 889-892.
Young et al., "Tree-Based State Tying for High-Accuracy Acoustic Modelling" ARPA Workshop on Human Language Technology, Merrill Lynch Conference Centre, pp 307-312, 1994.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Text-to-speech using clustered context-dependent phoneme-based u does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Text-to-speech using clustered context-dependent phoneme-based u, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Text-to-speech using clustered context-dependent phoneme-based u will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-277625

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.