Patent
1996-03-01
1997-07-29
Hafix, Tariq R.
395 267, 395 275, 395 276, G10L 502, G10L 900
Patent
active
056528280
ABSTRACT:
Improved automated synthesis of human audible speech from text is disclosed. Performance enhancement of the underlying text comprehensibility is obtained through prosodic treatment of the synthesized material, improved speaking rate treatment, and improved methods of spelling words or terms for the sysstem user. Prosodic shaping of text sequences appropriate for the discourse in large groupings of text segments, with prosodic boundaries developed to indicate conceptual units within the text groupings, is implemented in a preferred embodiment.
REFERENCES:
patent: 3704345 (1972-11-01), Coker et al.
patent: 4470150 (1984-09-01), Ostrowski
patent: 4685135 (1987-08-01), Lin et al.
patent: 4689817 (1987-08-01), Kroon
patent: 4692941 (1987-09-01), Jack et al.
patent: 4695962 (1987-09-01), Goudie
patent: 4783810 (1988-11-01), Kroon
patent: 4783811 (1988-11-01), Fisher et al.
patent: 4829580 (1989-05-01), Church
patent: 4831654 (1989-05-01), Dick
patent: 4896359 (1990-01-01), Yamamoto
patent: 4907279 (1990-03-01), Higuchi et al.
patent: 4908867 (1990-03-01), Silverman
patent: 4964167 (1990-10-01), Kunizawa et al.
patent: 4979216 (1990-12-01), Malsheen et al.
patent: 5040218 (1991-08-01), Vitale et al.
patent: 5212731 (1993-05-01), Zimmermann
patent: 5384893 (1995-01-01), Hutchins
Sagisaka, "Speech synthesis from text"; IEEE communications magazine, pp. 35-41 vol. 28 iss. 1, Jan. 1990.
Fitzpatrick et al, "Parsing for prosody: what a text-to-speech system needs from syntax", pp. 188-194, 27-31 Mar. 1989.
Moulines et al, "A real-time French text-to-speech system generating high-quality synthetic speech"; ICASSP 90, pp. 309-312 vol. 1, 3-6 Apr. 1990.
Willemse et al, "Context free wild card parsing in a text-to-speech system"; ICASSP 91, pp. 757-760 vol. 2, 14-17 May 1991.
"Assigning Intonational Features in Synthesized Spoken Directions", James Raymond Davis and Julia Hirschberg; 26th Annual Mtg of Assoc. Computational Lingustics; 1988 pp. 187-193.
"The Intonational Structuring of Discourse", Julia Hirschberg and Janet Pierrehumbert; Association of Computational Linguistics; 1986 (ACL-86).
"Synthesis by Rule of Prosodic Features in Word Concatenation Synthesis", J. S. Young, F. Fallside; Int. Journal Man-Machine Studies (1980) V12, pp. 241-258.
"Speech Timing and Intelligibility", A.W.F. Huggins; Attention and Performance VII; Hillsdale, N.J.: Erlbaum 1978.
"Speech Synthesis from Concept: A Method for Speech Output From Information Systems", S.J. Young and F. Fallside; J. Acoust. Soc. Am. 66(3), Sep. 1979, pp. 685-695.
"Perception of Synthetic Speech Produced Automatically by Rule: Intelligibility of Eight Text-to-Speech Systems"; B. G. Green, J. S. Logan, D. B. Pisoni; Behavior Research Methods, Instruments, & Computers, V18, pp. 100-107, 1986.
"Perceptiual Evaluation of DECtalk: A Final report on Version 1.8*"; B. G. Greene, L. M. Manous, D. B. Pisoni; Research on Speech Perception Progress Report No. 10; Bloomington IN. Speech Research Laboratory, Indiana University (1984).
"Evaluating Synthesizer Performance: Is Segmental Intelligibility Enough"; K. Silverman, S. Basson, S. Levas, International Conf. on Spoken language Processing, 1990.
Kim E. A. Silverman, Doctoral Thesis: "The Structure and Processing of Fundamental Frequency Contours", University of Cambridge (UK) 1987.
"From Text to Speech:: The MIT talk System", J. Allen, M. S. Hunnicutt and D. Klatt, Cambridge University Press (1987).
"Evaluating the Overall Comprehensibility of Speech Synthesizers", T. Boogaart, K. Silverman; Proc. Int'l Conf. on Spoken Language Processing (1990).
"On Evaluating Synthetic Speech: What Load Does It Place on a Listener's Cognitive Resources", Proc. 3rd Austal. Int'l Conf. Speech Science & Technology (1990) K. Silverman, S. Basson, S. Levas.
"Human Factors and Synthetic Speech"; J. C. Thomas and M. B. Rosson; Human Computer Interaction--INTERACT '84; North Holland Elsevier Science Publishers (1984) pp. 219-224.
Hafix Tariq R.
Michaelson Peter L.
NYNEX Science & Technology, Inc.
Straub Michael P.
LandOfFree
Automated voice synthesis employing enhanced prosodic treatment does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Automated voice synthesis employing enhanced prosodic treatment , we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Automated voice synthesis employing enhanced prosodic treatment will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-638778