Data processing: speech signal processing – linguistics – language – Speech signal processing – Synthesis
Patent
1997-01-29
1998-03-24
Hafiz, Tariq R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Synthesis
704258, 704266, 704267, G10L 502
Patent
active
057323950
ABSTRACT:
Improved automated synthesis of human audible speech from text is disclosed. Performance enhancement of the underlying text comprehensibility is obtained through prosodic treatment of the synthesized material, improved speaking rate treatment, and improved methods of spelling words or terms for the system user. Prosodic shaping of text sequences appropriate for the discourse in large groupings of text segments, with prosodic boundaries developed to indicate conceptual units within the text groupings, is implemented in a preferred embodiment.
REFERENCES:
patent: 3704345 (1972-11-01), Coker et al.
patent: 4470150 (1984-09-01), Ostrowski
patent: 4685135 (1987-08-01), Lin et al.
patent: 4689817 (1987-08-01), Kroon
patent: 4692941 (1987-09-01), Jacks et al.
patent: 4695962 (1987-09-01), Goudie
patent: 4783810 (1988-11-01), Kroon
patent: 4783811 (1988-11-01), Fisher et al.
patent: 4829580 (1989-05-01), Church
patent: 4831654 (1989-05-01), Dick
patent: 4884972 (1989-12-01), Gasper
patent: 4896359 (1990-01-01), Yamamoto et al.
patent: 4907279 (1990-03-01), Higuchi et al.
patent: 4908867 (1990-03-01), Silverman
patent: 4964167 (1990-10-01), Kunizawa et al.
patent: 4979216 (1990-12-01), Maisheen et al.
patent: 5040218 (1991-08-01), Vitale et al.
patent: 5212731 (1993-05-01), Zimmermann
patent: 5384893 (1995-01-01), Hutchins
Julia Hirschberg and Janet Pierrehumbert, "The Intonational Structuring of Discourse", Association of Computational Linguistics: 1986 (ACL-86) pp. 1-9.
J.S. Young, F. Fallside, "Synthesis by Rule of Prosodic Features in Word Concatenation Synthesis", Int. Journal Man-Machine Studies, (1980) V12, pp. 241-258.
A.W.F. Huggins, "speech Timing and Intelligibility", Attention and Performance VII, Hillsdale, NJ: Erlbaum 1978, pp. 279-297.
S.J. Young and F. Fallside, "Speech Synthesis from Concept: A Method for Speech Output From Information Systems", J. Acoust. Soc. Am. 66 (3), Sep. 1979, pp. 685-695.
B.G. Green, J.S. Logan, D.B. Pisoni, "Perception of Synthetic Speech Produced Automatically by Rule: Intelligibility of Eight Text-to-Speech Systems", Behavior Research Methods, Instruments & Computers, V18, 1986, pp. 100-107.
B.G. Greene, L.M. Manous, D.B. Pisoni, "Perceptual Evaluation of DECtalk: A Final Report on Version 1.8*", Research on Speech Perception Progress Report No. 10, Bloomington, IN. Speech Research Laboratory, Indiana University (1984), pp. 77-127.
Kim E.A. Silverman, Doctoral Thesis, "The Structure and Processing of Fundamental Frequency Contours", University of Cambridge (UK) 1987.
J.C. Thomas and M.B. Rosson, "Human Factors and Synthetic Speech", Human Computer Interaction--INTERACT '84, North Holland Elsevier Science Publishers (1984) pp. 219-224.
Y. Sagisaka, "Speech Synthesis From Text", IEEE Communications Magazine, vol. 28, iss 1, Jan. 1990, pp. 35-41.
E. Fitzpatrick and J. Bachenko, "Parsing for Prosody: What a Text-to-Speech System Needs from Syntax", pp. 188-194, 27-31 Mar. 1989.
Moulines et al., "A Real-Time French Text-To-Speech System Generating High-Quality Synthetic Speech", ICASSP 90, pp. 309-312, vol. 1, 3-6 Apr. 1990.
Wilemse et al, "Context Free Card Parsing In A Text-To-Speech System", ICASSP 91, pp. 757-760, vol. 2, 14-17 May, 1991.
James Raymond Davis and Julia Hirschberg, "Assigning Intonational Features in Synthesized Spoken Directons", 26th Annual Meeting of Assoc. Computational Lingustisics; 1988, pp. 1-9.
K. Silverman, S. Basson, S. Levas, "Evaluating Synthesizer Performance: Is Segmental Intelligibility Enough", International Conf. on spoken Language Processing, 1990.
J. Allen, M.S. Hunnicutt, D. Klatt, "From Text to Speech: The MIT Talk System", Cambridge University Press, 1987.
T. Boogaart, K. Silverman, "Evaluating the Overall Comprehensibility of speech Synthesizers", Proc, Int'l Conference on Spoken Language Processing, 1990.
K. Silverman, S. Basson, S. Levas, "On Evaluating Synthetic Speech: What Load Does It Place on a Listener's Cognitive Resources", Proc. 3rd Austal. Int'l Conf. Speech Science & Technology, 1990.
Hafiz Tariq R.
Michaelson Peter L.
Nynex Science & Technology
Straub Michael P.
LandOfFree
Methods for controlling the generation of speech from text repre does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Methods for controlling the generation of speech from text repre, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Methods for controlling the generation of speech from text repre will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2298946